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ABSTRACT 



We use scattering theory, simulations, and empirical constraints on 
interstellar scintillations to discuss the intermittency of radio signals from 
extraterrestrial intelligence (ETI). The number of ETI sources in the Galaxy 
has a direct influence on the expected dynamic range of fluxes in a survey, 
through inverse square-law effects and, equally importantly, by the number 
of independent statistical trials made on modulations caused by interstellar 
scintillations. We demonstrate that scintillations are very likely to allow 
initial detections of narrowband signals, while making redetections extremely 
improbable, a result that follows from the skewed, exponential distribution of 
the modulation. This conclusion holds for relatively distant sources but does 
not apply to radio SETI toward nearby stars 100 pc). 

Recent SETI has found non-repeating, narrowband events that are largely 
unexplained. We consider three models in order to assess these events and 
to analyze large surveys in general: (I) Radiometer noise fluctuations; (II) 
A population of constant Galactic sources which undergo deep fading and 
amplification due to interstellar scintillation, consistent with ETI transmissions; 
and (III) Real, transient signals (or hardware errors) of either terrestrial or 
extraterrestrial origin. 

We derive likelihood and Bayesian tests of the models for individual 
events and globally on entire surveys. Applying them to The Planetary 
Society/Harvard META data, we find that Models II and III are both highly 
preferred to Model I, but that Models II and III are about equally likely. In 
the context of Model II, the likelihood analysis indicates that candidate events 
above threshold (~ 32cr) are combinations of large amplitude noise fluctuations 
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and scintillation gains, making it highly probable that events seen once will only 
very rarely be seen again. Ruling oTit Model II in favor of Model III is difficult 
— to do so, many more reobservations (e.g., thousands) are needed than were 
conducted in META (hundreds) or the reobservation threshold must be much 
lower than was used in META. We cannot, therefore, rule out the possibility 
that META events are real, intrinsically steady ETI signals. 

Our formalism can be used to analyze any SETI program. We estimate the 
number of reobservations required to rule out Model II in favor of Model III, 
taking into account that reobservations made promptly sample the same 
scintillation gain as in the original detection, while delayed reobservations 
sample a decorrelated scintillation modulation. The required number is a strong 
function of the thresholds used in the original survey and in reobservations. 

We assess optimal methods for applying statistical tests in future SETI 
programs that use multiple site and multiple beam observations as well as 
single site observations. We recommend that results be recorded on many more 
events than have been made to date. In particular, we suggest that surveys 
use thresholds that are far below the false-alarm threshold that is usually set 
to yield a small number of noise-induced "detections" in a massive survey. 
Instead, large numbers of events should be recorded in order to (1) demonstrate 
that background noise conforms to the distribution expected for it; and (2) 
investigate departures from the expected noise distribution as due to interference 
or to celestial signals. In this way, celestial signals can be investigated at levels 
much smaller than the false-alarm threshold. The threshold level for archiving 
candidate intensities and their corresponding sky positions is best defined in 
terms of the recording and computational technology that is available at a cost 
commensurate with other survey costs. 



1. INTRODUCTION 

The commonly anticipated signal in radio searches for extraterrestrial intelligence 
(SETI) is a narrowband, slowly modulated signal of sustained duration. For example, 
bandwidths less than 1 Hz and signal durations of days and longer are often hypothesized. 
These signal characteristics are closely related to the traditional requirement in science for 
repeatability of an experimental or observational result as well as to the hope that members 
of one class of ETI signal — those that are targeted explicitly to us — will sustain themselves 
long enough to establish repeatability. 
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By contrast with such an ideahzed type of signal, it is easy to imagine other kinds of 
received signals that depart significantly from the idealization, particularly with regard to 
duration and amplitude stability. Moreover, several recent searches have found signals that 
are narrowband, do not match the characteristics of known, interfering signals of terrestrial 
or solar system origin, and yet, do not repeat when follow-up observations of the relevant 
sky positions are performed. The META project ([Horowitz fc Sagan 1993| ; hereafter HS93) 



found several dozen such signals at 1420 and 2840 MHz. Similarly, SERENDIP HI has 
found ^ 100 transient candidate events at 430 MHz (Bowyer, Werthimer, & Donnelly 1994). 
An older example of this kind is the "WOW!" signal found in the Ohio State SETI program 
at 1420 MHz ( [Dixon 1985|) . For META the characteristic signal duration is no more than 



~ 1 minute and all reobservation attempts — ranging from minutes to years after receipt of 
the original candidate signal — have failed. Intense scrutiny of the WOW sky position (e.g., 
Gray 199"5[ ) has also failed to redetect a signal. 



A feature common to all SETI programs is that, by virtue of the multidimensional 
search space (direction, frequency, duration, etc.), large numbers of independent statistical 
tests are performed, typically ~ 10^^-10^^. The aim is to separate noise, interference, and 
candidate real signals. This winnowing process is based on the rigorously known properties 
of background noise and on an ad hoc expert system designed to reject terrestrial signals 
which fail sidereal and doppler requirements that bona fide celestial signals must satisfy. As 
a result of the large number of trials, even pure noise can produce large amplitudes that are 
statistically highly improbable in a single trial. Similarly, unusual interference, whether of 
natural or artificial origin, that fails to be identified by rejection algorithms can be recorded 
as candidate signals. 

In this paper, we address signal intermittency in SETI programs. Our goals are to 
1) develop the statistical apparatus for interpreting transient signals; 2) assess surveys, 
particularly META (HS93), that have yielded nonconfirmed events; 3) prescribe optimal 
methods for reporting the results of ongoing SETI programs; and 4) improve the search 
protocols of future SETI programs. 

A short list of possible explanations for intermittency includes: 

1. Noise in the receiver electronics, including thermal noise, cosmic ray induced events, 
and hardware failures; 

2. Radio frequency interference (RFI) whose origin is terrestrial, from Earth orbit, or 
from interplanetary spacecraft; 



3. Natural, extrinsic modulation of narrowband astrophysical sources such as that caused 
by interstellar (radio) scintillation (ISS) or gravitational lensing; 
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4. Natural, extrinsic modulation of ETI sources; and 

5. Intrinsic intermittency at the source of an ETI signal due to natural causes, such as 
planetary rotation or the nature of the transmission (e.g., planetary radar) or, for 
deliberate reasons, to frustrate detection and decryption by non-target civihzations. 

Causes (2) and (5) are not dissimilar in their transmission properties though their sidereal 
and doppler properties may allow them to be distinguished. In this paper, we use a 
Bayesian likelihood analysis as the apparatus for comparing cause (4) with several of the 
other, plausible explanations for signal intermittancy. 

Our focus on cause (4), with particular emphasis on ISS, is motivated by two 
considerations. First, it is the simplest possible explanation for the transient behavior of 
signals that perhaps otherwise would be steady. Second, ISS is important at centimeter 
wavelengths ( [Rickett 1990 ) that are commonly used in searches for ETI. Centimetric 



wavelengths are often favored because background noise levels are minimized relative 
to other wavelength bands and it is argued that spectral lines from HI and OH in this 
"waterhole" band serve as signpost or "magic" frequencies at which to transmit and receive 
( piiver 1973| ). Most importantly, however, we find that ISS by itself is sufficient to explain 
the lack of confirmation of any ETI candidate signals in surveys conducted to date. 

We consider ISS at waterhole frequencies using the framework of Cordes & Lazio (1991) 
and a recent model for Galactic free electrons by Taylor & Cordes (1993). We show that 
the properties of scintillations expected from a Galactic population of ETI sources are 
intimately related to the number of such sources, if quasi-uniformly distributed, thought to 
exist in the Galaxy. This follows because the time scale and characteristic bandwidth are 
strongly dependent on the distance of the source. In addition, the maximum increase in 
amplitude due to scintillation expected for any source is related to the number of statistical 
trials made and is therefore also related to the number of sources in the Galaxy. 

In previous papers ( [Cordes fc Lazio 199 1| ; [Cordes fc Lazio 1993| ; [Lazio fc Cordes 1998[ , 
hereafter CL91, CL93, & LC98, respectively), we have emphasized the need for multiple 
observations of the sky in order to combat the effects of interstellar scintillations. In 
particular, we have found that the optimal number of observations of a specific direction 
is a few and that these can restore some of the detection probability that would be lost 
otherwise in a single pass on the sky. Here, we focus on reobservations of an initial candidate 
detection and find a seemingly contradictory result: namely, that to rule out the reality of 
the candidate signal, it takes hundreds, if not thousands, of reobservations to do so. 

In §0 we summarize the effects scintillations impose on celestial signals from compact 
sources. In §|] we discuss the distribution of a Galactic population of extraterrestrial 
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civilizations and, for simplicity, we assume that the Galactic population is a set of "standard 
candle" transmitters. In §^ we summarize some recent SETI programs and the candidates 
they detected. Basic issues underlying an interpretation of survey results are outlined 
qualitatively in §^ while in §^ we introduce the likelihood functions that form the basis of 
our quantitative analysis. In §0 we analyze individual candidate events assuming that any 
multiple observations occur at times such that interstellar scintillations are uncorrelated; 
correlated scintillations are treated in In §P we extend our likelihood analysis to surveys 
as a whole. 



We extend our analysis to two-station SETI in §|T0| where we show that many of 



the challenges in verifying scintillating signals at a single site persist for simultaneous 



observations made at a pair of sites. We propose an alternative detection scheme in §11 



that uses all of the data in a survey and is much more sensitive than a signal-to-noise ratio 



limited survey. Our analyses are distilled into recommendations for future SETI in [|T^ and 
in §|13| we present our conclusions. 

In Appendix ^ we define symbols used throughout our analysis and Appendix |B] 
discusses scintillating source properties, including detection probabilities at single and 
multiple sites. 



2. INTERSTELLAR SCINTILLATIONS (ISS) 

In CL91, CL93, & LC98, we have described the impact upon source detection 
procedures resulting from radio waves propagating through a random, phase- changing 
medium, such as the ionized interstellar medium (ISM) or the interplanetary medium (IPM). 
The reader is directed to those papers for details on the salient features of scintillations 
(also see [Rickett 1990|) . Here we summarize scintillation phenomena that are relevant to 
our analysis. 



2.1. Brief Summary 

The scattering regime in which we are most interested is that which produces saturated 
intensity scintillations. In this strong scattering case, intensity variations branch into two 
forms ( [Rickett 1990|) : fast, diffractive scintillations (DISS) and slow, refractive scintillations 
(KISS). DISS is manifested as 100% variations of the intensity in both time and frequency. 
The characteristic time and frequency scales are discussed below. RISS typically shows 
10-20% variation on times scales of days to months and is correlated over a wide range of 
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frequency. At centimeter wavelengths, most of the Galaxy is seen in the strong scattering 
regime. For sources near the Sun, e.g., < 100 pc at 1 GHz, weak scintillations occur. A 
transition region extending to about 500 pc at 1 GHz produces scintillations with greater 
than 100% intensity variations (c.f. LC98; Rickett 1990 ). Consequently, in the remainder 



of this paper, we shall concentrate on the impact of strong scattering on SETI surveys done 
to date. Also, we discuss only the effects of DISS because modulations from it are orders of 
magnitude larger than those from RISS in large-scale SETI. 

While our interest here is primarily on intensity variations from interstellar scattering, 
phase fluctuations induced by the medium may also manifest themselves as Doppler shifts 
or frequency wandering of narrow spectral lines (CL91). In SETI programs optimized for 
finding narrowband signals, follow-up observations on candidate signals need to account 
for the possibility that signals drift or shift in frequency on a variety of time scales. 
Drifts may be caused by interstellar scattering, interplanetary scattering in the solar wind, 
and, presumably, in the stellar wind of the transmitting civilization's host star. These 
drifts are much less than the Doppler shifts caused by the spin or orbital motion of a 
planet. Nonetheless, they must be taken into account when making follow-up observations 
of candidate signals because they can still span several to many frequency bins of a 
narrow-band spectrometer. 



2.2. Characteristic Time and Frequency Scales 

The intensity of a scintillating source is correlated on characteristic time and frequency 
scales: the scintillation time, At^, and the scintillation bandwidth, Az/j. For the vast 
majority of lines of sight, the scintillation bandwidth exceeds the anticipated intrinsic 
widths of deliberately transmitted lines 1 Hz). In addition, broadening of the line by 
propagation effects is much less than 1 Hz for most lines of sight. Therefore, assuming 
intrinsic line widths are not substantially less than 1 Hz, the line shape is essentially 
unaffected by scattering while the line amplitude is modulated for most lines of sight ( Prake 
fc Helou 19771 ; CL91). 

When scintillations are saturated and the line shape undergoes negligible changes, 
the only astrophysically variable parameter is the scintillation time scale, At^ oc z/^'^/V^ 
where V± is a characteristic transverse speed that is a combination of the velocities of the 
source, observer, and medium. The time scale also depends on distance, with more distant 
sources having shorter time scale scintillations. For a medium with homogeneous statistics, 
Atd oc D~^'^ (Cordes & Lazio 1991), but the dependence is more complex for a realistic 
model of the Galaxy, as discussed below. The time scale has been measured to be seconds 



- 7- 



to hours for pulsars ( Cordes 1986 ), whose transverse velocities dominate the relevant V± 



and are in the range of 30 to 2000 km s ^ ( [Harrison fc Lyne 1993| ; [Cordes, Romani~S^ 



Lundgren 1993[ ; [Lyne fc Lorimer 1994[ ). By comparison, the motions of main sequence stars 



in the Galaxy and the orbital motions of their planetary companions are much lower than 
the average pulsar speed. Therefore, we expect that an ETI source at the same distance as 
a given pulsar will show a characteristic scintillation time scale that is much longer than 
that seen from the pulsar, by typically an order of magnitude. We point out that differential 
galactic rotation does not play a large role because the scattering material also participates 
in the rotation ([Cordes 1986| ). We also emphasize that, as the scintillation time is frequency 
dependent, an assessment of the role of scintillations must take the observation frequency 
into account. 

Through modeling of the free electron density in the Galaxy, it is possible to estimate 
the scintillation time scale for ETI sources. The most recent model for the free electron 
density and its fluctuations ([Taylor fc Cordes 1993[) incorporates all available data on pulsar 
distances and dispersion measures, and radio scattering data on pulsars. Galactic maser 
sources, and extragalactic sources viewed through the ISM. 

Figure [I| shows contours of the scintillation time Atd for directions in the plane of the 
Galaxy (i.e.. Galactic latitude 6 = 0). We have assumed a transverse speed Vj_ = 10 km s~^ 
for all ETI sources and an observation frequency of 1.42 GHz. Given the range of possible 
orbital motions and motions in the Galaxy, the effective velocity could vary by a factor of 
two or more, either lower or higher. It is evident that the scintillation time scale is of order 
minutes only for the most distant regions of the Galaxy. For most directions, it is typically 
hours. 

The scintillation time scale, therefore, significantly exceeds the typical time spent 
on a given sky position in SETI unless all sources are in the Galactic plane at large 
distances. Consequently, one may consider any putative (intrinsically steady) ETI signal 
to be constant over the observation time, though scintillations may have modulated it 
significantly upward or downward in apparent strength relative to its long term mean. 
Prompt repeat observations of a candidate signal made sooner than one diffraction time 
scale after an initial, tentative detection will "see" the same signal strength. Conversely, 
repeated observations made at times when the scintillation modulations are mutually 
independent will detect a large range of signal strengths that follows the distribution 
of scintillation modulations. For saturated scintillations, this distribution is a one-sided 
exponential function. 

The reader may think that Fig. |l| can be used to dismiss immediately candidate signals 
in META as being due to RFI, since they are not redetected immediately after their initial 
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detections. This is not the case. As we show below, candidate signals can be detected 
in META-like surveys as a combination of noise fluctuations and scintillations of celestial 
signals. The noise decorrelates immediately, so even a persistent source can disappear into 
the noise after an initial detection. 

Figure |l] also shows the scintillation bandwidth, Az/^, at 1.42 GHz. The scintillation 
bandwidth is found to be much greater than the hypothesized signal bandwidth 1 Hz) of 
deliberate ETI signals for all lines of sight through the Galaxy. The diffraction bandwidth 
scales with frequency as Au^ oc u'^''^. 

The reciprocals of the scintillation time scale and bandwidth are also of potential 
interest, yielding the bandwidth of spectral broadening and the pulse-broadening time, 
respectively, as Aush = (2vrAtd)~^ and = (27rAz/d)~^ (e.g., [Lovelace 19701 ; CL91, CL93). 
Using values for the scintillation time scale Ata from Fig. ^ it is clear that spectral 
broadening <C 1 Hz at 1.42 GHz. This confirms our previous statement that line shapes 
are essentially unaltered by scattering if they are intrinsically wider than the spectral 
broadening from scattering. 



2.3. Detection Probabilities 

Intensity statistics are discussed in Appendix |B[ There we distinguish variations of the 
intensity over short time scales, when the scintillation modulation is constant, from those 
over long times when the modulation varies according to its exponential distribution (in 
strong scattering). In the former case the intensity probability density function (pdf) is 
that for a phasor of constant amplitude gS combined with noise, where g is the scintillation 
"gain" and S is the source intensity in the absence of scintillations. The pdf fi{r,gS) is 
given in Eq. p^ , with S — > gS. In the second case, where g varies over its sample space, the 
pdf, /7,scint(-^; S"), is a simple, one-sided exponential with mean (I) = S + {N), where S is 
assumed to be constant and is radiometer noise. 

Each pdf implies a detection probability that the intensity exceeds a specified threshold 
intensity, 1^. We call these Pd(-^T; S) and -Pd,scint(-^T; '5'), respectively, and define them in 
Eq. (p8l)-(p9|). When there is no signal, just noise, S = 0, the detection probability is 
the false-alarm probability. In Fig. ^ we show detection probabilities with and without 
scintillations for the case where the false-alarm probability is 10~^^, corresponding to a 
threshold It{N) = log 10^^^ = 27.6. Recent SETI programs do, in fact, achieve conformance 
to exponential statistics up to large thresholds such as this so long as terrestrial RFI is 
removed (cf. Figure 4 of HS93; [Levin et al. 1995| ). 
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We reiterate a point discussed at length in our earlier papers: In the absence of 
scintillations, a source is essentially undetectable if S" < It, i.e., P{I > It) ~ 0. However, 
by the multiplicative nature of the scintillation gain, otherwise undetectable sources can 
be modulated above threshold. Since g is drawn from an exponential distribution, it can 
reach large values that result in a detection probability, Pd{lT, gS), significantly in excess of 
its value in the absence of scintillations, Pd{lT,S). Modulations which increase the signal 
strength are not the norm, though, and are in fact less likely than the converse. In contrast 
to our earlier papers (CL91, CL93, LC98), where we considered the probability of detecting 
a source at least once, in a survey, say, here we consider the probability of detecting a 
source at least twice, once in a survey and then again in a set of reobservations. 



2.4. Second-order Statistics 

Appendix |B| also presents results on second-order statistics that we use in discussing 
correlations in scintillation gain between several observations of the same source. 
Of particular interest is the conditional probability that a signal is detected in a 
second observation given that the intensity had the value Ii in an initial "detection," 
P2d{lT\h', S, Pg[At]). Here At is the time interval between observations and Pg is the 
autocorrelation function of the scintillation gain, normalized to Pg{0) = 1. This function is 
gaussian-like in form and has a width equal to the characteristic (diffractive) scintillation 
time. Aid. 

In Fig. ^ we show the conditional probability plotted against the correlation coefficient 
Pg G [0,1] for several values of the normalized source strength ( = S/{N), threshold 
?7t = It/{N), and intensity rji = Ii/{N) for an initial detection. As we discuss in § p!0| . 



observations made at two different sites may also be analyzed using Fig. ^. Like two 
temporally-spaced observations, the scintillation gain is partially correlated according to a 
spatial correlation function, ps. 

Figure ^ illustrates the importance of the combined effects of noise and scintillation. 
For values typical of META, rji = 32 and t^t = 20, the probability P2d is less than unity for 
a nominally detectable signal, ( = 32, even for perfectly correlated scintillation gains. If we 
suppose that the initial detection, rji, occurred because of the scintillation amplification of 
a weaker signal, e.g., ( < 16, the chances of not detecting the signal during reobservations 
are even greater. Combinations of ( and p can produce P2d ^ 0.1 in some cases. 



We will demonstrate that, in large scale surveys involving large numbers of statistical 
trials, threshold crossings (initial "detections") will occur through the combined effects 
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of noise and scintillating phasors (in the case where ETI signals exist!). Because of this 
"scintillation-noise conspiracy," confirmations of detected signals are difficult, owing to the 
rapid decorrelation of the noise on short time scales. Thus, for large surveys with large 
detection thresholds, the probability of redetection is small and, therefore, the absence of 
confirming redetections is insufficient evidence for concluding the absence of a possibly 
steady ETI signal. Similarly, simultaneous observations at two sites, where radiometer 
noise is uncorrelated, allow detection at one site and nondetection at the other with large 
probability. 

Qualitatively, noise by itself — without aid from scintillation — can also assist survey 
detection while inhibiting redetection. But quantitatively, for thresholds, candidate 
numbers, and numbers of reobservation trials like those in META, it is difficult to explain 
noise-assisted survey detections combined with the failure of all reobservations. Evidently, 
scintillations are needed to explain both. The reason lies in the difference in shapes of the 
detection probability curves in Figure | for intensity ratios S/It ^ 1. These, in turn, arise 
because, even though noise and scintillations both have one-sided exponential pdf's, they 
act asymmetrically because scintillations act multiplicativly on the true signal, while noise 
is additive. 



2.5. Scintillations as a Reality Check on Candidate Sources 

If a candidate signal is also detected in the reobservation phase of a SETI program, 
scintillations can serve to confirm that the candidate signal is in fact coming from beyond 
the solar system (cf. HS93). The strength of a signal from a candidate source should vary 
from observation to observation in a manner consistent with the exponential distribution, 
provided the observations are separated by more than a scintillation time and, of course, 
that scintillations are strong. 

In CL91, we alluded to another scintillation phenomenon, spectral wandering. If 
observed over time periods shorter than the scintillation time, the centroid of a narrow-band 
signal will show random but correlated shifts in frequency in addition to any Doppler shifts. 
If the signal is averaged over times much larger than the scintillation time, the signal will be 
spectrally broadened, i.e., the spectral shape of the signal will converge to a Gaussian- like 
shape with width Az/sb ~ l/At^ (CL91). 

Expressions are presented in CL91 for Az/gb as a function of distance and frequency 
for both weak and strong scintillation regimes. For Galactic sources, we expect frequency 
wandering to be roughly the reciprocal of the scintillation time scale. Reference to Fig. |1| 
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shows that wandering will be at the level of 10~^ to 0.1 Hz. There are some directions for 
which the level of scattering is underestimated by the model used to construct Fig. |I|. For 
these directions (such as sources in or beyond the Galactic center), the spectral wandering 
can exceed 0.1 Hz. 



3. GALACTIC ETI TRANSMITTERS 

In this section, we show that the number of ETI transmitters in the Galaxy influences 
the expected distribution of source fluxes and determines the scintillation properties of the 
sources as manifested in a particular survey. 



3.1. Demography 

Suppose the number of transmitting civilizations in the Galaxy is as estimated, 
for instance, from the Drake equation ( [Shklovskii fc Sagan 1966| ). The number density of 
sources is 

assuming that all civilizations reside in a disk of radius Rq and thickness 2Hg. 

The volume through the Galactic disk sampled by a radio telescope of beam sohd 
angle f2b is Vb = f2b-D'^/3, where D is the characteristic distance through the disk, 
D ^ mm{HQ/ sin \b\,RQ), where b is the Galactic latitude. D is direction-dependent, but 
observations in the plane will yield D ~ Rq. The number of sources in a typical beam is 

(2) 

where 6h is the one-dimensional beam width (FWHM) and we assume Rq/Hq = 150. If 
there are more than about 10^ sources in the Galaxy, every telescope beam (of 1 deg size) 
will contain at least one source, on average. It is convenient to rewrite A^b as 

(-Dmfp) ^ ^ 

where the "mean free path" for encountering a source in the beam is 

/ 3 /501\"V 150 y"/ldegV" 
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The typical distance between sources is 




(5) 



where A^d is the number of sources such that ij) equals one scale height of the Galactic disk 
(Hg): 



D 



(6) 



MgJ V 150 ; 

Analogous expressions can be derived for a spherical distribution of ET sources. The 
concentration of META candidates toward the Galactic plane motivates our focus on a disk 
population, however. 

Finally, although we assume the strong scattering regime throughout this paper, 
there may be sources within the weak or transition scattering regimes if the number of 
civilizations in the Galaxy exceeds 10^ and 10^, respectively. These numbers follow from our 
estimates in §2.1| for the distances to which weak or transition level ISS occurs and matching 
these to Eq. ^ and solving for A^d- Also, the population of Galactic ET civilizations can be 
as large as 10^ depending on what fraction of the unaccounted-for events in META are, in 
fact, due to ETl sources (HS93). 



3.2. Source Strengths of Standard Candle Transmitters 

A Galactic population of transmitters, each radiating with effective isotropic radiation 
power (EIRP) V, will show maximum and minimum fluxes, 

S, = V/Anil,, (7) 



'max 



from the nearest and furthest sources, respectively. [] Within these bounds, the pdf for S is 



fs{s) = [^]\i-{So/s,r-'rU^] . (9) 



5, 



^ For simplicity, we ignore transmitter beaming in our analysis. Beaming away from us clearly extends 
the minimum flux So to zero. The net effect, of course, is to decrease the probability of detecting strong 
sources when there is a fixed number of transmitters in the Galaxy. 
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Disk and spherical populations are described by a = 2 and a = 5/2, respectively, as is well 
known in studies of natural radio sources ( ^cheuer 1974| ; [Condon 1974| ) and 7-ray sources 
( |Wasserman 1992| ). 



Taking fmin ~ Eb, and £max ~ Rg, we find that Si/Sq ~ A^d for 
Nb < N-D ~ lO^'^. In the large civilization limit (A^d ^ N-q), the scaling is 
Si/ So ~ {RG/HcfiNB/NBf/^ ~ 10^-^{NB/NDf/^ (c.f. Eq. |). Depending on the 
abundance of transmitting civilizations in the Galaxy, the dynamic range of source fiuxes 
may be modest or extremely large. 

In reality, radio transmission powers from Galactic civilizations will be described by a 
luminosity function for V that incorporates technological and sociological evolution. Since 
we do not know the form of the luminosity function, we will make use of the "standard 
candle" form of Eq. |^ in the remainder of the paper, keeping in mind that it is essentially a 
place holder for what may be a quite different distribution of source strengths. 



4. SURVEY PROGRAMS 

In this section we summarize current and recent survey programs; see also Table |l|. 



4.1. The META Program 

Horowitz & Sagan (1993) summarize the META Survey that uses a 2^^ point Fourier 
transform spectrometer to analyze a total bandwidth of 400 kHz (0.05 Hz resolution). Two 
observing frequencies were analyzed: 1420 MHz in the "waterhole," and its harmonic at 
2840 MHz. Spectra were obtained in 20 s data spans at each sky position after adjusting 
for the Doppler shift from the Earth's motion relative to each of three reference frames: 
the local standard of rest, the Galactic barycenter, and the cosmic microwave background. 
Over a five-year period, about 6 x 10^^ combinations of spectral channels and sky positions 
were investigated. Most of these, of course, yielded amplitudes consistent with the expected 
exponential noise. After removing obvious, terrestrial interference and setting a threshold 
such that few, noise-only events would exceed it, 74 events remained. Of these, half could 
be rejected through further investigation as being due to terrestrial interference, cosmic ray 
events in the semiconductors of the spectrometer, and other processor errors. 

The remaining 37 candidates include 14 at 1420 MHz and 23 at 2840 MHz with 
approximately equal numbers of candidates in each of the three reference frames. Of these 
37 events, 26 are formally below the false-alarm threshold (a In 6 x 10^^ = 31.7cr) defined 
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so that one event should exceed the threshold in the entire survey. HS93 comment that the 
26 events below 31.7cr are consistent with the exponential distribution expected from pure 
noise. Consequently, in our analysis, we restrict our attention to the 11 events in Table 2 of 
HS93 with amplitudes > 31.7cr and especially to those 9 events larger than 33a. 

When a candidate detection occurred, the real-time data acquisition system made 
reobservations of the sky position in the relevant reference frame beginning about 40 s after 
the initial detection and ending about 3 min afterward, when the source had drifted out of 
the primary beam of the telescope (when used as a transit instrument). It is notable that 
none of the candidate positions showed evidence for a signal in any of the reobservations, 
either within minutes after the initial detection, or days, months, or years later. The 
effective detection threshold in reobservations was significantly lower than in the main 
survey observations, being typically 20 times the mean noise level. 

If all 37 META events that survive culling are due to ETI, there must be > 10^ 
transmitting sites at 1420 and 2840 MHz in the Galaxy (HS93), based on a simple scaling 
of META's "duty cycle" in scanning the sky. 



4.2. The META II Program 

The META II system uses back-end hardware identical to that of META on a 30 m 
antenna in Argentina ( |Colomb et al. 1993| ). The observation frequency is 1.42 GHz. Like 



META, META II finds extrastatistical events that are not redetected when observations 
are made at the appropriate sky positions. Reobservations are made both promptly and 
with substantial delay of a day or more. For about 25% of the time since 1993 November, 
observations have been made simultaneously with META and META II in the overlapping 
sky region of —30 deg < S < —10 deg. 



4.3. The SERENDIP III Program 



The SERENDIP HI program (|Bowyer et al. 1994D has been operated at the Arecibo 



Observatory since 1992 April in a "piggyback" mode whereby pointing of the antenna is 
under control of other investigators, thus yielding semi-random but complete coverage of 
the Arecibo sky. In this program, a 12 MHz bandpass centered at 0.43 GHz is analyzed in 
2.5 MHz wide subbands. An FFT spectrometer obtains 4M point spectra in 1.7 s at each of 
5 subbands for a channel width of 0.6 Hz. The program has covered 89% of the observable 
sky at Arecibo (28% of the entire sky) at least once and about 18% of the observable sky 



- 15 - 



has been observed at least 5 times. The threshold is 15a and no explicit reobservations are 
made. During the survey, a total of 4 x 10^^ channel-sky position observations have been 
made in 4500 hr of observations. 



5. BASIC ISSUES IN ANALYZING SURVEY EVENTS 

In the following we compare alternative explanations for the candidate events in a 
survey. For definiteness, we cast the discussion in terms of the META survey, though our 
formalism is general. We stress the fact that, in META, no signals were detected in any 
reobservations of the relevant sky positions. 

5.1. Models for Survey Events 

The models we consider are: 

Model I: The measured signal consists only of radiometer noise in both the survey and 
the reobservations. 

Model II: The measured signal results from noise combined with a real celestial signal of 
average strength 5* that has scintillated into an improbably strong amplitude, with 
gain g in the survey, but has remained below threshold in the reobservations. We 
assume that the signal would be constant were it not modulated by scintillations. 

Model III: The survey detection results from a real, transient signal of either celestial or 
terrestrial origin that does not repeat in any of the reobservations.0 Terrestrial origins 
for the survey detection would include radio frequency interference (RFI) or cosmic 
ray events and failures in the electronics. We characterize the transient signal in the 
survey as an amplitude S'rfi- 

In Fig. ^ we show simulated time series for the three models. 

Models II and III are similar in that the survey is assumed to have acquired events that 
are extremely rare, whether due to atypical forms of RFI that pass through RFI-rejection 



^ One may compare the SETI case with gamma-ray bursts (GRB), which do not repeat, but have been 
identified as extraterrestrial sources. This identification rehes on the fact that terrestriaUy produced GRBs 
that would interfere with the detection of astronomical bursts are rare, thankfully, and because many GRBs 
have been observed with more than one spacecraft, allowing time-of-arrival constraints on their directions. 
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filters or to rare scintillation peaks of real celestial sources. Surveys that sift through tens 
of teraevents are indeed likely to yield such rare occurrences. 

In later sections we shall compare, in a quantitative sense, the viability of these three 
models in interpreting candidate events and their evident nonrepeatability. While it is 
tempting to dismiss the events as a rare kind of RFI, we take the point of view that one 
should be able to show quantitatively that Model III is superior to Model II. We also 
anticipate that future surveys will yield similar events that will demand assessment and 
that some may eventually show successful detections in the reobservations. Therefore, we 
shall also explore optimal methods for conducting and reporting future SETI programs. 



5.2. General Considerations on Scintillation Amplitudes 

Some simple facts about scintillations and the demography of transmitting sites in 
the Galaxy suggest that Model II may not be dismissed so easily. In the large number 
of trials conducted in META (6 x 10^^), it is remarkable that only 37 remaining signals 
are unidentified and, for the most part, they satisfy some of the requirements expected for 
a real ETI signal: a narrowband spectral line in a rest frame of celestial interest. If we 
interpret the nonrepeatability of the candidates to be the result of scintillation modulation, 
rather than to intrinsic variability, then it must be assumed that the scintillation gain was 
extraordinarily high at the time of the detection but was at more probable levels in all 
reobservations. In order for this to be a likely scenario, the number of statistical trials must 
be large enough so as to make it probable that a large scintillation gain occurred 37 times 
during the META survey. The probability of large gains, in turn, is closely related to the 
number of signals transmitted from all ETI sites in the Galaxy. 

As before, let be the number of ETI civilizations in the Galaxy, each transmitting 
^lincs > 1 spectral lines. These lines may or may not be in bands observed by META or in 
any other survey. A small multiplicity of lines is advantageous for combatting interstellar 
scintillation ( |Uohen fc Charlton 1995| ; Pordes fc Sullivan 1995| ). If no "magic" frequencies 



or reference frames are believed to pertain to the choice of frequency, the best we can do a 
priori is specify a (large) spectral domain of bandwidth Bd for the possible frequencies of 
transmission. The total number of "scintillation trials" for a full sky survey is 

A^trial.,ISS ~ AoAiinesi^sP., (10) 

for Kg trials per sky position and where P,^ is the probability that the observed frequency 
band in a survey does, in fact, contain signals from ETI sources when a telescope is pointed 
toward a given source. In the optimistic case, where the choice of frequency bands is 
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believed to be the proper one and that one or more of the transmitted hnes for each and 
every civihzation falls into one of the chosen bands, 

Pu = 1. (11) 

However, if all frequencies in a large spectral domain Bd ^ Az/spectrometer are equally 
probable, then 

j-j ^ -^i^ -^frames Az/spectrometer /"I 0^ 

Let Pg(5'max) = exp(— (yfjnax) bc the probability that a scintillation gain (^max, or larger, 
occurs. Combined with the number of scintillation trials, we constrain values of (7max using 



-^trialsJSS-Pglfi'max) ~ Ncand (13) 

(14) 



or 

S-max ~ m 

cand 

Using values for META from the previous section 

r^^ax ~ In iO.llN^NuncsKM • (15) 

The meaning of Eq. ^ is that if one wants to invoke likely values for scintillation gains 
to account for candidate events, the gains must not strongly exceed a well-defined value, 
namely (^max- Given that there must be at least as many transmitters as there are 
candidates, A^d > A'cand, it may be concluded that g^^.^ can range from a value of order 
unity if there are few signals transmitted from sources in the Galaxy (in bands that have 
been observed) to a value that is easily in the range of 10 to 20 if ETIs abound in the 
Galaxy [Nd ^ 1) or are profligate in their use of the radio spectrum (Aiines ^ 1)- 

If dmax is large, the intrinsic source strength can be proportionately smaller. For 
candidate events with intensities /cand/ {N) ~ 32-40, say, the intrinsic source strength 
(i.e., in the absence of ISS) need be only /cand/(5'max(A^)), which could be just a few if 
5'max ~ 10 to 20. Since the survey threshold in META was ~ 32{N) and the reobservation 
threshold was ~ 20{N), the implication is that common values of the scintillation gain 
will render the signal undetectable, even with a large number of reobservations. For 
example, in 100 reobservations of a specific source, the maximum likely scintillation gain is 
In 100 = 4.6, yielding signal amplitudes gS <^ fi'/cand/fl'max ^ 15(A^), too small to be detected 
in reobservations. 
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5.3. General Considerations on Noise Amplitudes 

Compared to the number of independent ISS tests, an enormous number of independent 
trials is made with respect to noise. This number is 

-^trials, noise ~ 

where Nsi^y is the number of independent sky positions searched. As stated before, 
Atriais,noise ~ 6 X 10^^ for META. For the simplest spectral analysis of an unsmoothed FFT 
spectral estimate, the noise in the spectrum is distributed exponentially (c.f. Eq. with 
S = 0) and the false-alarm probability is specified so that Atriais,noise-Pfa ~ 1- For META, the 
corresponding threshold is about 32(A). This implies that much more modest amplitudes, 
say 16(A), occur many times (~ ^ Atriais.noisc ~ lO'^ for META) during the survey. We will 
find in our analysis below that these modestly frequent noise spikes, combined with rare 
scintillation gains, play a key role in determining the likelihood function for signals from 
ETl sources. 



6. LIKELIHOOD FUNCTIONS OF META-LIKE SURVEYS 

To quantify a survey, we define Ag^y as the number of independent sky positions (or 
telescope beam areas) considered; A^h the number of independent frequency channels; and 
Ks the number of independent observations or trials made per sky position. Consider a 
survey that yields Acand candidate events having amplitudes /*^™^(j) above a threshold 
The results of the survey may be described as the set of candidate signal intensities 

C={/(^")(j)>4'"\j = l,...,iVcand} (17) 

and the non-detections 

A- = {/(^'^)(J) < = 1, . . . , As, - A,a,d} , (18) 

where Agu = Atriais,noise- Further, assume that the direction for each candidate survey event 
is reobserved Kj. times and that no signal is found above a threshold intensity yielding 
the set of non-detections during reobservations, 

n = {k,{j) X Ai;;°) amplitudes < 4™), j = 1, . . . , Ae^nd} • (19) 

Because the total number of statistical tests in reobservations is much smaller than in the 
survey, the reobservation threshold may be significantly smaller than in the survey. For 
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simplicity in our discussion, we assume the threshold of all reobservations to be the same, 
but this need not be the case. It is a simple matter to account for thresholds that vary due 
to use of different dwell times or different background noise levels. 

We distinguish reobservations where the scintillation gain is correlated with the gain, 
Qci during the original candidate detection from those where it was uncorrelated. In 
practice, without explicit detection of an ETI signal over a sustained time, one does not 
know how long the gain is correlated because it depends on the strength of scattering to 
the particular source. Nonetheless, statistical inference from a set of observations (viz. an 
initial detection with only upper bounds in reobservations, as in META) depends on the 
degree of correlation assumed. We will make the simplifying assumption that reobservations 
have either completely correlated or completely uncorrelated gains. Consequently, we write 
the number of reobservations as the sum of correlated and uncorrelated numbers, 

i^r(j)=^r,c(j)+^r,u(j)- (20) 



The likelihood function for a given, individual candidate event is 

£ _ £(su)£(ro) 



(21) 



where C^^^'^ is proportional to the probability density evaluated at the event's survey 
amplitude I*^^^^ and iZ^™) is the probability that no detection was made in reobservations. 
For the three models presented in § p.l| , the survey likelihoods are (dropping the j subscript 
for clarity) 



CI 



(su) 



su) 



^(su) 
^(su) 

''-III 



9cS 
Skpi 



(22) 



where fi is defined in § p.3| and Appendix 0; A/ ^ J*^^") is the uncertainty in establishing 
the survey amplitudes and Qc is the scintillation gain at the time of the candidate detection. 
The approximate equalities in Eq. 



result because the exact form would be integrals over 
intensity of /i over intervals AJ centered on the candidate intensity. The reobservation 
likelihoods are the probabilities of no detections in reobservation trials: 

Kr 
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Detection probabilities were introduced in §p.3| and Appendix M. 



For Model II, we have written separate, reobservation likelihood functions for correlated 
and uncorrelated scintillation gains. It is useful to normalize intensities, thresholds, and 
signal strengths by the mean noise level {N), e.g.. 



T) = I/{N) 
C = S/{N). 



(25) 



The likelihood functions become 
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(26) 



where /q is the Bessel function and we have dropped a (constant) factor (A J/ (A^)) on the 
right-hand side of each equation. 



7. INDIVIDUAL CANDIDATE EVENTS WITH NO CORRELATED 

REOBSERVATIONS 

In the surveys summarized earlier, unexplained, extrastatistical events are found in a 
few directions that, when reobserved, yield no evidence for persistent signals. 

Here, we consider multiple reobservations of a given sky position (i.e., an individual 
candidate source) that are completely statistically independent. For the case of scintillating 
sources, this means that reobservations are made more than one scintillation time after an 
initial detection. While the META survey did, in fact, make "prompt" reobservations only 
40 s after initial candidates were found, it is useful,for pedagogical reasons and for purposes 
of generality, to analyze events as if they were statistically independent. It is possible, 
for example, that the reobservations are, in fact, statistically independent if ETI sources 
are rare and are at large distances, thus producing short scintillation times (i.e., ^ 40 s). 
Moreover, we show in 5BI that correlated reobservations combined with uncorrelated ones do 



not change substantially the conclusions we find here. In §^]2| we apply a likelihood analysis 
to an entire survey, which allows constraints to be made on the overall population of ETI 
transmitters that may exist in the Galaxy. 



- 21 - 



7.1. Comparison of Likelihood Functions for Individual Events 

Some initial conclusions may be made from direct inspection of the likelihood functions. 
Firstly, given that survey events necessarily have large amplitudes (above the false-alarm 
threshold and, thus, well above the mean noise level (N)), the noise-only Model I has a low 
likelihood compared to the other two models. This becomes clear for events rj > tjt ^ 1 
when we approximate Io{x) ~ e^/ y/27rx and calculate the ratio (dropping the 'RFI' 
subscript on (): 

Cui exp(2v/r/C - C) 



Ci ' (47rV^)i/2 ■ ^^^^ 

For signal strengths 1 < C ^ and candidate strengths rj > 32, we have Cm/Cu > 10^. 
Similarly, Model II can be shown to be far more likely than Model I for many parameter 
values. In general, for r] ~ (^pi ~ gcC 3> 1 and 3> 1 we find that Models II and III are 
much more strongly preferred than Model I. For sufficiently small scintillation gains, Qc, 
however, ( can be large enough that the reobservation factor C^i°^ /Cj^"^ will make Model II 
less likely than even Model I. 

Secondly, if we hypothesize, in Models II and III, that a strong source (real or RFI) 
underlies the event and that, whatever its nature, the amplitude was the same in either 
model (i.e., gcC = Crfi), then ^jj"-* = £in\ but the net likelihood ratio is (assuming only 
uncorrelated reobservations for model II) 



l_e-^''T°V(C+i)l'^''" 

<1. (28) 



(ro) 

1 - e-^T 



The larger any real signal of average, continuous strength is in Model II, the smaller the 
likelihood ratio in Eq. ^ and the more that Model II is disfavored. However, if the survey 
detection was made when was far out on the tail of the exponential pdf, then can be 
sufficiently small that Cu/ Cm — > 1. 

For a specified lower limit on r, e.g., r > 0.5, one obtains a likely upper bound on C, 

(ro) 

C < Cmax = ; ^ - 1. (29) 



In 



(ro) 



For i^r^u = 256 (typical of META) and r > 0.5, we obtain C, < 2.38 using a reobservation 
threshold of rj!^"^ = 20. For a typical candidate signal strength, rj = 32, a scintillation 
gain Qc = v/C ^ 13.4 is required in the survey. The probability of this occurring is small, 
Pg(13.4) = e~^^'^ = 10"^'^, but it is large enough that in a many-trial survey, such a gain is 
likely to be encountered if A^triais, iss is large, as discussed in ^57^ 
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Therefore, with measurements such as those outhned in §|], it is impossible to favor 
Model II over Model III. While Pg(13.4) = 10~^'^ is not large, given that we do not know 
the number of transmitters in the Galaxy and what (7max niight have been in the survey, 
we also cannot favor Model III over Model II. Statistically, over a set of events, however, 
it may be possible to conclude that an unlikely ensemble of scintillation gains is required 
to account for the known events. Also, the likelihood does not automatically account for 
the probability that particular scintillation gains will occur. For this reason, we extend our 
analysis in the next section to include Bayesian factors. 



7.2. Bayesian Analysis of Individual Events 

In this section we adopt a Bayesian analysis for the a posteriori joint pdf of Qc and C, for 
a particular candidate event. The Bayesian analysis proceeds by deriving a posteriori pdf's 
for the unknown parameters QcX^ Crfi by taking the product of the appropriate likelihood 
function (for a given model) with prior pdfs for the parameters and normalizing by the 
integral of the product over the parameter space. The advantage of this analysis is that it 
penalizes improbably large scintillation gains and large source fluxes. 

The a posteriori pdf for a vector of parameters is, in terms of the survey detection 
with amplitude rjc and nondetections in reobservations, 

where C represents one of the likelihood functions defined previously^; fg^O) is the joint a 
priori pdf for the parameters 0] X represents background information; and the denominator 

/(r/,|x) = j de f0{e)c{v,\ei) (31) 

normalizes the pdf. 

Model I for candidate events (noise only) has no parameters other than the background 
noise level, which we assume to be known. 



Following standard practice, we explicitly show the background information as an element of the analysis. 
However, nonstandardly, we use "pdf" notation for the posterior and prior pdfs for the parameters rather 
than using the common, but misleading, p notation. We also write the likelihood function with arguments 
that often are written only when the identical quantity is referred to as the sampling distribution (e.g., 
Gregory fc Loredo 199^ ). 
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In Model II, the two parameters to be determined for each event are the scintillation 
gain, and the intrinsic (average) signal strength, (. We assume gc and ( are statistically 
independent in the joint prior pdf, which therefore factors into the individual pdf 's. The 
prior pdf for gc is fg{gc) = exp(— (/c), appropriate for the strong scattering regime, which 
we assume. For ( (i.e., S) we adopt a pdf that corresponds to the power-law distribution 
of sources considered in §^]2| (cf. Eq. |). We assume a Galactic disk population of sources 
(a = 2) and take the range of source strengths to be large, Ci ^ Co {or Si ^ Sq). The prior 
pdf for C is then 

/c(c)^Cor'f/(c-Co)f/(Ci-c) (32) 

where we have used Heaviside functions U to define the cutoffs of the power-law pdf. 
The posterior pdf for Model II then becomes 

r f ^1 T-N /g(^c)/c(C)Ai(^c|^C^) 

f{Vc\I) 

For the assumed prior pdfs (and dropping the Heaviside functions for clarity), we have 

where the proportionality constant is Co//(^c|^)- Strictly, the cutoffs Co,i of the source 
strength distribution are also parameters to be estimated or constrained and they could be 
included explicitly in the posterior pdf, necessitating inclusion of corresponding prior pdfs 
on the right hand side of Eq. ^3H3^. For now, however, we treat these as hidden parameters, 
assuming that the range of source strengths, Co < C ^ Ci; is large enough to include the 
best fit values of gc and C- 

For Model III, which considers candidate events in the survey to arise from events 
that are intrinsically transient, we hypothesize a distribution of RFI amplitudes, /rfi(Crfi), 
that has parameters to be determined via application of Bayes' theorem. As an 
illustration, we consider the simple example of a pdf that is constant in the interval 

Co,RFI = < CrFI < Cl.RFI' 

/rfi(Crfi) = -^?7(Crfi)?7(Ci,rfi - Crfi). (35) 

U.RFI 

With the exception of multiplying constants, the resulting posteriori distribution does not 
differ from Cm. 



(34) 
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7.2.1. Application to META 



Of the candidates reported by HS93, the majority of the extrastatistical events are near 
rjc = 32 with a few at larger rjc. In Fig. |^ we show contours of the posterior pdf, Eq. for 
four cases: rjc = 32 and 100 with thresholds rj!^"^ = 10 and 20; in all cases we assume that 
i^Tr^u = 256 reobservations were attempted. 



Table 2 displays the values of Qc and ( that maximize the posterior pdf for the specific 
rjc and rj^"^ we have considered here. In general, we find that as rj^ increases (for constant 
r]^"^), the peak value of the pdf decreases, the region of constant probability becomes more 
concentrated about the peak, and the peak moves to larger gain and slightly larger signal 
strength. 

A significant outcome of this analysis is that, at the time of the initial detection, 
the apparent source strength, gcC, is only about 50% of the measured candidate strength. 
The implication is that, even at these large intensities, the noise contributes significantly 
and sources with intrinsic intensities well below threshold will most likely be seen as the 
combination of a large scintillation gain and a large noise spike. Although large, the 
scintillation gains derived here are considerably less than those derived in our earlier 
discussion of the likelihood functions for Models II and III (cf. §7J.) where an observed 
intensity of rjc = 32 implied a scintillation gain Qc ^ 13.4. These smaller scintillation gains 
are considerably more probable, P{g > 6.2) = e~^'^ = 10"^ ''. 



Figure ^ shows the best fit values for Qc, C, and QcC as a function of the number of 
uncorrelated reobservations, i^r.u- We show results for three values of the reobservation 
threshold, rj^"^ = 5, 15, and 20. In all cases, the product QcC is less than the apparent 
source strength at the time of the detection. This figure demonstrates two effects: (1) that 
noise in general plays a key role in producing candidate events in a large scale survey; and 
(2) even a large number of reobservations fails to constrain the source flux to be very small 
unless the reobservation threshold rj^"^ ~ 5, much less than that used in META. 

Our conclusions may be understood by referring to the various factors in Eq. For 
event amphtudes (rjc) of interest, the argument of the Bessel function is large, 2^J g^Crjc ^ 1- 
Using the large- argument approximation for Jq, we may write the posterior pdf for and C, 

as 



/ii(^c,CI^2:) oc 



-9c 



Vc-\/gc( 



(^c^cC) 



-1/4 



e-^^™V(i+C) 



(36) 



The third and fourth bracketed factors in Eq. ^ represent the likelihood function Cu 
and, by themselves, are maximized for g^C ~ rj^ and C 1, respectively. Maximizing the 
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likelihood function therefore yields an apparent source flux gcC that is nearly equal to the 
observed candidate event t]c but favors actual source strengths ( that are small; hence, the 
ML value for is large. The ML estimate for is unrealistic because it is unlikely to have 
occurred in the number of trials considered, unless ETI is hyperabundant in the Galaxy. 
The Bayesian estimation procedure is far more realistic because it builds in penalties for 
both large scintillation gains and strong sources via the first two factors in Eq. ^ These 
factors favor, respectively, small gc and small (. We then see that it is through these 
two factors that the Bayesian estimates yield apparent source fluxes that are considerably 
smaller than the measured event amplitude. 

The a posteriori pdf also displays, for small ri^"\ an important bimodality, as seen 
in Fig. ^c. One mode occurs at "large" g^. and (, gc > 1 and ( > 1, the other at the 
origin of the contour plot. The bimodality occurs once the number of reobservations Kj-^^ 
becomes large enough that the fourth factor in Eq. 36 compensates the third factor. For 
still larger -fTr.u, the reobservation factor dominates and only the maximum near the plot 
origin remains. This "collapse" of the posterior pdf is a consequence of the particular 
source strength distribution, fs{S), we have adopted, namely a disk population of standard 
candles, which favors C once the fourth factor dominates the third factor. In practice, 
we search for the maximum of fii{gc,C\v^) with a minimum ( > 0; for Fig. |^, we have 
searched ( > 0.1. 



7.3. A Critical Number of Reobservations 

The bimodal form of the posterior pdf in Eq. may be used to derive the number 
of reobservations needed to rule out Model II, where the source strength, (, is assumed 
constant in time. For META parameters, we have seen in the previous section that a small 
number of reobservations influences the parameter estimates for g^. and ( but they do not 
rule out Model II. Since, in the limit ( ^ 0, Model II becomes identical to Model I, our 
approach will be to estimate the number of reobservations, K^,u, required to make C ^ 1 
the most probably value for (. 

The logarithm of the posterior pdf, Eq. 0, is 

An = -(7e - 2 In C + In /,(r/c, 9cC) + ^r,u In \l - Pd,scint (v?\ C)] ■ (37) 



For the case where there are no reobservations (-ft'r.u = 0), An is maximized for (^c = 2 and 
( = ric/2 — 9/4. The difference between max(An) and its value for small signal strength, 
C ^ 1, is ~ ?7c. For reobservations to affect the location of the maximum significantly, the 
last term in Eq. ^7\, which favors small (, must alter the slope dXn/dC, so that it becomes 
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negative for all C 1- For such negative slopes, there cannot be a "large" ( maximum 
in An that would imply a real signal was present. By maximizing An with respect to Qc 
and requiring a negative slope at ( = 1, we obtain the number of reobservations needed 
to rule out a detectable signal. The result is Kj-^u > [-^r,u]crit where the critical number of 
reservations is, approximately, for rjc ^ 1, 



[-^r,u]crit ~ 



o 



(ro) 

Vt 



In Fig. 1^ we show [iTr.Jcrit plotted against the reobservation threshold 77^°'' for several 
values of candidate strength rjc- The figure demonstrates that large thresholds produce 
exponentially large numbers of reobservations required to rule out Model II. For META, 
where a reobservation threshold 77^°'' ~ 20 was used, [-ft"r,u]crit ~ 10^'^ for candidate events 
just above the survey threshold, r^^^"^ = 31.7. Stronger candidate events require a greater 
number of reobservations. 

We conclude that candidate events in META cannot be ruled out as being due to 
scintillating sources, even if they transmit with constant power aimed into our direction, 
because there were too few reobservations and the threshold was too high. 

We also conclude that constraining reobservations must be made at lower thresholds 
than have been used (cf. Fig. However, a lower reobservation threshold yields a larger 
number of false alarms. The false-alarm probability in reobservations is exp — ?7x°'' • The 
number of false alarms expected in the critical number of reobservations (per frequency 
channel) is 



AA^fa = [K^A^r^^Pt^ = (1 - ) (r^, - 9). (39) 

Figure |^ also shows AAfa plotted against reobservation threshold. The dashed line shows 
where one false alarm is expected per frequency channel. For rjc = 32 and 77^°^ — 5, 
AAfa ~ 0.38. While this number is small on a per-channel basis, when combined with the 
fact that reobservations need to search in frequency space to take into account unknown 
doppler shifts, it suggests that many false alarms are to be expected in reobservations made 
at small, otherwise constraining thresholds. 
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8. INDIVIDUAL CANDIDATE EVENTS WITH CORRELATED 

REOBSERVATIONS 



In §|7.2| the Bayesian estimates for scintillation gain and source strength ( for 



individual events were made while assuming that the scintillation gain had decorrelated 
completely at the reobservation times. This corresponds to reobservations that are made 
hours or perhaps even just minutes after an initial detection. In META, however, a few 
reobservations were made within 40 s of the initial detection, a time short enough that 
the scintillations could have been highly, if not perfectly, correlated between the first few 
reobservations and candidate detection. In this section we consider such prompt, correlated 
reobservations in the Bayesian estimate of signal parameters in the context of Model II. 

We assume, as before, that a candidate signal with amplitude, rjc, is reobserved 
in u unsuccessful trials where the scintillations are uncorrelated. In addition, we 
include K^^c reobservations, also unsuccessful, where the scintillations are correlated over 
the time spanning the reobservations and the initial detection. To illustrate the effects 
of such correlated reobservations, we assume they are 100% correlated. In reality, the 
degree of correlation would vary slowly from 100% to zero correlation. But the point 
here is to demonstrate the stronger influence of perfectly correlated reobservations. We 
find, in fact, that the Bayesian estimates for and ( are quantitatively different when 
correlated observations are included. However, the same conclusion is reached as before: 
the candidate events occur (if Model II is assumed) because noise and signal conspire to 
produce a threshold crossing in the survey. Moreover, though the number of uncorrelated 
reobservations needed to rule out Model II decreases when one or more correlated 
reobservations is made, the required number is still rather large unless smaller-than-used 
thresholds are used in the reobservations. 



Unsuccessful, correlated observations introduce a new factor in the likelihood function 
and a posteriori pdf (cf. Eq. |33H3^, 1 — Pd(-^T ^fi'cC) or, equivalently, a term in the 



log pdf, 

X[r^ = K,^Jn[l-P,{lt\gX)]. (40) 

This term favors small values of gcC- When included with the other factors in the posterior 
pdf, it pushes estimates for ^fcC to values that are lower than if there were no correlated 
observations. The influence of correlated reobservations is illustrated in Fig. |^ (cf. Fig. |^), 
where estimates for gc, C, and gcC are plotted against K^^u, for several values of K^- c, with 
thresholds 77^°'' = 20, 10, and 5. The effects of the Ajj'^°'* term are to decrease the best 
fit C as Kj- c increases. Also, the critical number of uncorrelated reobservations needed to 
favor C ^ 1 decreases as Kj- c increases. For the larger reobservation thresholds, 77^°'' = 20 
and 10, the necessary number of reobservations still measures in the hundreds, even for 
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K^c = 16 correlated reobservations. With the smaller threshold, only a few (but more than 
one) correlated reobservations cause the best fit C to collapse below unity. 

In Fig. analogous to Fig. ^ we show the number of correlated reobservations needed 
to rule out Model II. Not surprisingly, this number is substantially smaller than the number 
of correlated reobservations. In practice, of course, the scintillation time scale limits 
how many correlated observations may be done. A general expression for the number of 
uncorrelated and correlated reobservations needed to rule out Model II is 



^£- + P."-,^,_/,>(^). (41) 



We conclude that: 



1. correlated but unsuccessful reobservations reduce estimates for and ( from those 
seen in Fig. |^ where only uncorrelated reobservations were assumed; 

2. the occurrence of unsuccessful, correlated reobservations does not obviate the 
conclusion that initial survey detections result from a noise-scintillation conspiracy 
where both noise and signal are needed to provide the survey detection; and 

3. the noise-scintillation conspiracy makes it unlikely to detect the signal in a small 
number of reobservations. 



9. GLOBAL ANALYSIS ON ENTIRE SURVEYS 

In this section we return to the question we raised in §7. 1|: Although radiometer noise 



alone cannot explain the extrastatistical events seen in META and similar surveys, how 
likely is the scintillation interpretation? Restated, is the required ensemble of scintillation 
gains consistent with the gain pdf and with a plausible distribution of intrinsic source 
strengths? 



9.1. Survey Intensity Distribution 

As in we take Ai'cand to be the number of candidate signals, i.e., those intensities 
above threshold, in the survey with the signal intensities in the set C (cf. Eq. |l^. These 
intensities are archived while the remaining Ngu — Nc^nd ^ Ai'cand observations are discarded, 
so that the only information for these observations is that the intensity was less than 
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Now suppose that some fraction esky of the beam areas covered in the survey 
contain transmitting civihzations whose received signal strengths S {before accounting for 
scintillation modulations) are described by a pdf, fs{S), and that some fraction ech of the 
frequency channels surveyed contain power from broadcasting civilizations. In terms of 
previously defined quantities, we may write 

esky = l-(e-^^), (42) 

where iVb is the average number of sources expected per beam (c.f. Eq. |^) and the 
exponential is averaged over all directions. The spectral fraction is 

-^signals/beam'^^spectrometer -^signals/beam /A'3\ 

where iVsignais/beam = A^iines ( A^b ) , with (A^b) being the number of sources in a telescope beam, 
averaged over all directions. The total fraction of sources contained in the A^su survey 
"cells" is given by e^^ = eskyCch- 



The fraction esky can be related to the number density of sources defined in §^ri| As 
we show in that section, even a small number of transmitting civilizations, ~ 10^, can result 
in more than one source per beam so that esky need not be small and may in fact approach 
unity. However, it is assumed that transmitting civilizations will use narrow bandwidths, 
<^ 1 Hz, in a small number of individual channels. Since the center frequencies of such 
narrowband signals are not known a priori, it is obviously necessary to cover as large a 
range in frequency as possible in the survey, necessitating that the number of channels, A"ch, 
be quite large (e.g., in META, A'ch ~ 10^). Consequently, we expect esu ^ ech ^ 1. 

Within the context of the three models presented in §§^|-§|, the sampling pdf for the 
candidate intensities is 

/f)(/) = (l-es,-eRFi)/,(/,5 = 0) 

'rf5/5(S)/,,scint(/,^) 

+ eRFi/rf5/RFi(5RFi)//(/,5RFi). (44) 

Here the first term accounts for observation cells devoid of sources, the second term accounts 
for those cells with scintillating sources, and the third term accounts for transient sources. 
Analogous to esu, we have introduced eRpi, the fraction of observation cells in which there is 
a transient signal. Just clS 6su CcLll be related to the number of transmitters in the Galaxy, 
the rate at which transient signals occur and their duration can be related to eRpi. 

Following §^ we use likelihood functions involving fj^^^ to test various models. Our 
goal is to test the hypothesis that a population of scintillating sources can account for some 
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of the events seen in SETI programs and to estimate parameters of any such population. 
Accordingly, we shall consider the same models as in a population of scintillating 
sources, Model II, for which e^Fi = 0, and an RFI-only explanation for the events. Model III, 
for which egu = 0. However, should there be a method of distinguishing between terrestrial 
RFI and narrowband celestial transients in the future, our analysis can easily be extended 
to include such possibilities. 

We shall also need the detection probability for the survey. Integrating Eq. we have 

Pt^ = (1 - e,^)Pt^ + / dS /5(S)Pd,scint S) , (45) 

for Model II; for Model III, a similar expression holds with the substitution of eRpi for egu 
and Pd{I^^\ S) for -Pd,scint(-^T"'*; '^)- Here, Pf^^^ refers to the false-alarm probabihty for the 
survey as a whole, hence, Ng^P^^^^ ~ 1. We expect 

(iVcti) = N^uPt^ (46) 

to be the expected number of candidate signals found in the survey. If there is noise only, 
amplitudes will follow the exponential pdf that results from Eq. El with S = 0. 



In practice, the noise level (A^) is not constant over all sky positions, due to ground 
spillover and Galactic latitude dependent backgrounds. A survey may be divided into 
regions where (A^) is constant and each region can be analyzed individually. Here, however, 
since our goal is to exemplify a method of inference, we assume (A^) to be constant. 



9.2. Likelihood Function for the Entire Survey 

The likelihood function for the survey is 

where the factor for survey candidates is 

-^cand 

/:cand = n ft\lj) (48) 

and the non-detection factor is 

^non = n P(/</r) = [l-^f^ (49) 
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For our assumed form of /) , the parameters of interest are the fraction of survey cells 
containing sources or RFI, egu and eRpi, and the parameters of the source and RFI strength 
distributions, fs{S) and /rfi('S'). 

As in our analysis of individual events in §^ we again assume that reobservations occur 
at times such that the scintillation gain is uncorrelated between all survey and reobservation 
measurement S0. With K,.{j) reobservations per candidate, the likelihood function for the 
reobservations is 

^^"^ = n n ^ < 4 ) = 
j=i k=i 

Equivalent to the survey detection probability in Eq. ^ is a reobservation detection 
probability 

Pt^ = (1 - e,o)Pfi™^ + / dS /5(^)Pd,scint S) (51) 

where we have distinguished, through the label "ro," the false-alarm probability, the 
source-detection probability, the threshold, and the fraction of cells containing sources 
in the reobservations from those conducted in the survey observations. Reobservations 
have a lower threshold intensity, and, accordingly, larger false-alarm and detection 
probabilities. The beam fraction, e^^^, is also different from that in the survey because 
directions are selected in the survey that have shown strong signals. If there is a population 
of sources we would expect e^^^^ 1, but we still expect e^™"* 1 ero ^ ^ch'''- 
instance, to cover a bandwidth equivalent to the Doppler shift resulting from the Earth's 
motion, 10~^c, at an observing frequency of 1 GHz requires 10^ channels of 1 Hz each. In 
this example, therefore, e^™'* ~ 10"^. 

In our earlier analysis of the individual events, §^ we assumed that S'rfi = during 
reobservations. In contrast, since we are now analyzing the entire survey, we must allow for 
the possibility of RFI to be present during the reobservations. Consequently, in considering 
reobservations in Model HI, we use Eq. ([5l| ) with the substitution (^It"\ '^^ place of 

p f rM q\ 

d,scint 1 -^T 

The complete likelihood function is 

£ = C^''''^&°\ (52) 
In practice, we analyze the log likelihood, 

A = log £ = Acand + Anon + A^. (53) 



1-p. 



(ro) 



(50) 



^ Assumption of uncorrelated vs. correlated reobservations is not critical; reobservations of survey 
candidates are dominated by the vastly greater number of nondetections in the survey itself. 
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Parameter values may be estimated through maximization of A. Alternatively, Bayes' 
theorem may be applied by multiplying £ by a prior probability density for the parameters 
and normalizing to obtain the posterior pdf for the parameters. In the following we analyze 
only the likelihood function. This choice is made because (1) the pdfs for scintillation gain 
and source strength are already built into the likelihood function; and (2) any prior pdfs for 
the relevant parameters, (i, (q, egu, and ero, would be so broad that they would not change 
the net results. 



9.3. Fitting Model II to META 

In Fig. |l^a and b we show the likelihood functions for Model II at 1420 and 2840 MHz, 
respectively, using the META results (HS93). Table 3 reports the location and amplitude 
of the maximum in the likelihood function for both the survey and the survey plus 
reobservations of candidates. In column 1 is the radio frequency, column 2 is the maximum 
amplitude of the logarithm of the likelihood function, and columns 3 and 4 are the maximum 
likelihood values for CoCsu and Ci, respectively. We do not present the results for Acand or 
Anon because there exists no single maximum for these functions, but rather a region of 
constant and maximum likelihood. 

There are several aspects of these results which deserve explanation. Horowitz & 
Sagan (1993) report 14 candidates at 1420 MHz and 23 candidates at 2840 MHz, all with 
intensities greater than 28 (N). However, they also acknowledge that they have included 
candidates just below threshold so as not to exclude potential sources. In producing 
these likelihood functions, we have used only those 11 candidates above the proper survey 
threshold, rj!^^^ = 31.7. We have also assumed that all reobservations were conducted with 
a threshold of 1]!^°^ = 20 (Pfa = 10~^'^) and that each candidate was reobserved Kj = 256 
times. In order to continue plotting the likelihood functions as functions of only two 
parameters, we have set the ratio, = Cro/esu, to be constant and equal to unity. Finally, 
the likelihood function for Model II is plotted as a function of Co^su and (i. The degeneracy 



between the parameters Co and esu arises because fs{S) oc Co, cf. Eqs. and^. In the 

limit Co ^ 1; the quantities fj^^\l), P^^^\ and P^°^ become approximately linear in the 
quantity Coesu- 

There is no difference in either the magnitude or location of the peak likelihood when 
comparing Asu and Asu+ro- The lack of the reobservations' influence arises because of the 
vastly larger number of nondetections in the original survey than reobservations. For the 
non-detections, 



Anon ~ iVs, log 1 - Pt\v^T''\ ^su, Cl) ~ -N^uPP , M 



,(su) 
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where we have used the expansion log(l — x) ~ —x. A similar expression holds for Afo, 
Aro ~ — A^'cand-^r-Pd™'*- These two likelihood functions have a similar shape in the (Coesu)Ci) 
plane. However, because N^^ 10^^ while A'candA'r ~ 10^, the nondetection likelihood 
completely dominates the influence of the reobservation likelihood. 

Thus, Asu+ro is dominated by Acand and Anon- Acand increases (and Anon decreases) if 
either Co^su or Ci (o^' both) increases, as a result of the larger number of sources that can be 
detected. When these two are combined to produce Agu+ro, the maximum of the likelihood 
function occurs at values for Co^sn and Ci that are intermediate to those which respectively 
maximize Acand and Anon- As Si is the upper cutoff to the source strength pdf, a simple 
constraint on C,i is that it cannot be too much smaller than the intensity of the strongest 
candidate observed^. Similarly, given the small number of candidates (i.e., a few tens out 
of ~ 6 X 10^^ total trials), we would expect that esu is small. The contours extending to 
very large Ci arise, in part, because of the exponential pdf for the scintillation gain. Since 
the most probable gain is (7 = 0, very large Ci can be tolerated and the absence of sources 
with C ~ in the survey is explained by assuming that scintillations modulated any such 
sources below threshold. 

The locations of the peaks at the two frequencies compare favorably. The location of 
the peak likehhood at 1420 MHz is within 10% of the peak likelihood at 2840 MHz and 
the 2840 MHz peak hkelihood is within 50% of the 1420 MHz peak likehhood. The lack of 
better agreement can be attributed to the fact that there were twice as many 2840 MHz 
candidates as 1420 MHz candidates and the strongest 2840 MHz candidate had 77 = 746.6 
while the strongest 1420 MHz candidate had 77 = 224. 

The difference in strongest candidate signals between the two frequencies would 
seemingly explain the difference in the amplitude of the likelihood functions as well. It does 
not. At 1420 MHz, there are four candidates, three with ?a 34 and one with rj = 224. 
At 2840 MHz, there are seven candidates, three with r] fa 32, one at 77 = 746.6, and three 
with 40 < 1] < 80. It is these last three candidates which cause the marked difference 
in the maximum value of the likelihood functions, simply because there are so many of 
them. An occasional large scintillation gain can combine with a large noise fluctuation (cf. 



§ |7.2.1| ) to produce a signal well above threshold. Hence, the likelihood function is somewhat 
insensitive to one candidate with an observed intensity well above threshold. However, 
obtaining several, independent combinations of such gains and noise fluctuations becomes 
increasingly, and rapidly, less likely. 



^ Ci can be smaller than the intensity of an observed candidate by virtue of upwards noise and scintillation 
fluctuations. 
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9.4. Fitting Model III to META 

The above discussion implicitly assumes that Model II (a population of scintillating 
sources) is the correct model. To compare Model II with Model III (the RFI model), we 
have also constructed the likelihood functions for this model, Fig. |lO|c and d. The maxima 
and locations of the likelihood functions are tabulated in Table 3. 

In estimating these likelihood functions, we have taken fnpi to be a fiat function, as 
in §7.2| , with an upper limit S'l rfi and a lower limit S'o,rfi = 0. As for Model II, we have 
assumed that each candidate position was reobserved 256 times and that is constant and 
equal to unity. 

The likelihood functions for Model III are very similar to those of Model II. Again 
Asu+ro ~ because of the vastly larger number of survey observations; Acand increases 
with increasing eRpi and/or Ci,rfi while Anon decreases, with the combination of these two 
producing a peak in Agu. 

There are two notable differences between the likelihood functions for the two models, 
both arising from the assumed /rfi- Since the pdf is fiat, all intensities are equally likely, in 
contrast to Model II, where it is possible to "hide" very large intensities with a very small 
scintillation gain. Consequently, a smaller e^Fi is required for Model III than Model II in 
order that there not be too many candidates. Similarly, the peak likelihood tends to be 
more concentrated about the most likely Ci,rfi; rather than extending to very large values 
as for Model II. 

The key issue between these two models, though, is the value of the likelihood functions 
at the peak. At both frequencies, we find that the maximum likelihoods differ only slightly, 
with the RFI model slightly preferred at 1420 MHz and the scintillating source model 
slightly preferred at 2840 MHz. We conclude that we are unable to favor either Model 11 or 
Model III. 



9.5. Interpretation of Model II Fitting Results 

The fitting results yield estimates for ^o^su = Co^skyech and Ci, the normalized upper 
cutoff of the source flux pdf. Using results from §^ and Eq. (^)-(|43|), we may relate these 
estimated quantities to the population parameter Ad, the number of sources in the Galaxy. 
Figure |Tl] shows egky as a function of the mean free path (c.f. Eqs. and ||3^) for a simple 
disk model for the Galaxy. We have assumed a circular disk of radius Rq = 15 kpc and 
thickness 2Hg = 0.2 kpc with the Sun at the disk center (heliocentricity for simplicity!). 
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From the fitting results we can derive two estimates for Nd, tlie number of sources in the 
Galaxy, in terms of the survey parameter, egu- The first relation results from inverting the 
relation e^n = Csky^ch where esky is related to -Dmfp (as in Fig. |TI|) which, in turn, is a function 
of iVo, D^fp oc iV-^/': 

= eski(e-^ch/P.), (55) 

where 'e^y' means the inverse of the function esky(A^D) that relates the number of sources 
A^D to esky! Pu is the probability that a signal is in the band observed by META. Here we 

define this probability as = A^signals/bcamAz/spectrometer/^D- 

The second relation is based on the relationship between A^d and the flux ratio 
S'i/5'o = Ci/Co- We can determine only d and the product (oesu from the likelihood analysis. 
Let the ratio of these fitting parameters be 



Ci 



Inverting the ratio Ci/Co (cf- yields 



(56) 



N^^ < No 



Ar(f) = 1 - Fesu/2N^ (57) 

Figure |l^ shows the two estimates for A^d plotted against for P^, = 0.01. Increasing 
P^ shifts N^^ (solid curve) to the right. Therefore, in order that there be a solution 
defined by the crossing points of the two curves, P^, must exceed a minimum of about 
0.003. This would suggest that the total radio domain of ETI signals is about 300 times 
the META bandwidth if there is only one ETI transmitter per beam. But with more than 
one transmitter per beam, the spectral domain can exceed the META bandwidth by much 
more than this factor. 

For values of P^, that provide solutions, the number of civilizations can be anywhere 
from a few to in excess of 10^°. The reason for this indeterminacy is that, with larger esu, 
Co decreases. This implies, simply, that though there are many more sources for larger egu, 
the vast majority is buried in the noise. Unfortunately, the ambiguity in A^d means we 
cannot establish, through this analysis, the distance scale for putative sources consistent 
with META candidates. Therefore we cannot constrain the transmitter power with the 
available information we have used. 



Horowitz & Sagan point out that the strongest candidates in META cluster about the 
Galactic plane. We can use the spatial distribution to estimate the distance scale and thus 
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provide another constraint on N^. For the uniform disk model of and where sources can 
be detected above threshold out to a distance -Dmax with Hq < -Dmax < Rg, we have 

From the nine strongest META candidates, we estimate (sin^ b) ^ 0.097 compared to a value 
1/3 for an isotropic population, implying D^g,^/ Hq ^ 2.3. The disk subvolume sampled 
by these nine events is Vg = [3(Dmax/-f^G)^ ~ 1] out of a total volume V = 27iHqRq. 

For Rq/Hq = 150, as in §^ we find that V/Vg ~ 10^'^. Thus the nine events found in Vg 
imply the presence of A^d ~ 10^'^ sources in the Galaxy. This number is comparable to the 
cross-over point of the two lines in Figure |T^. 

We conclude that if the nine strongest META events are real celestial sources (natural 
or artificial), there are a few tens of thousands of such sources in the Galaxy. 



10. DUAL STATION & DUAL BEAM OBSERVATIONS 

Here we discuss the joint intensity statistics of SETI made simultaneously at two sites. 
Most of our discussion also applies to multiplying interferometry, such as proposed by 
Welch (1983). Simultaneous SETI observations at two terrestrial locations are analogous 
to pairs of single-site observations made at different times when one considers the level of 
correlation for the interstellar scintillations. 

In Appendix P we discuss the spatial correlation of ISS. The correlation length is 
directly related to the scintillation time because the underlying process is a diffraction 
pattern swept across the line of sight at a transverse speed V±. Referring to Fig. |l|, 
scintillation times for nearby sources are measured in hours while much faster scintillations, 
with time scales of seconds, will be seen from sources across the Galaxy. Transverse 
speeds V± ~ 10 km s~^ then correspond to length scales ranging from tens of km to 
> 10^ km. Dual-site SETI observations will therefore encompass all possible degrees of 
correlation of the scintillation gain. These are (1) perfect correlation for nearby sources; 
(2) partial correlation for intermediate distances; and (3) no correlation for distant sources. 
At 1.4 GHz, sources within 1 kpc will show highly correlated scintillations for any pair 
of terrestrial sites and those at distances greater than 5-8 kpc will display uncorrelated 
gains for sites more than 1000 km apart. We emphasize that the relevant distances for 
demarcating these regimes are frequency dependent. At frequencies higher than 1.4 GHz, 
more sources in the Galaxy will show scintillations that are spatially correlated between 
two sites. 
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Dual-site observations are therefore equivalent, statistically, to single-site reobservations 
made promptly after an initial detection (such that ISS is strongly correlated) and those 
made with substantial time delay, in which case the ISS is uncorrelated. As we saw in §p75 



the probability of detecting an ETI signal in even a prompt reobservation need not be large 
and the probability of redetection in a delayed, uncorrelated reobservation can be negligible. 

If A^triais,iss is large (cf. Eq. |^), a survey is expected to encounter rare, high amplitude 
scintillations, in which case weak sources may be detected at one site as a combination of 
noise and signal with low chances of redetection (either later in time or at the other site). 
This conclusion restates our analysis of META events made in §^ Similarly, small iVtriais.iss 
means that only smallish scintillation gains are to be expected in a survey. In both cases, 
however, rare noise fluctuations play a key role in detections of sources because of the large 
number of noise trials. 



We refer to the analysis of §2]3| on temporal sequences of observations to analyze 
dual-site observations. Figure |^ shows the detection probability of a second observation 
with threshold It given that a first observation was made that yielded an intensity Ji. The 
detection probability is plotted against the correlation coefficient p (which may be either the 
temporal or the spatial correlation). The results for the several values of It and < p < 1 
indicate that the probability of detection at a second site is small, even when p is large. 



10.1. META & META II Two-station Observations 

META and META II devote some observing time to simultaneous observations at 
1.42 GHz and with a survey threshold ?7t = 16 at each site. For detections at threshold 
at one site, i.e., rji = 16, Fig. || provides the detection probability for the other site. Even 
for large intrinsic signal strengths, e.g., ( = 32, the chance of not detecting the source 
at the second site is non-negligible, ~ 35%. If the initial detection was of a weak source, 
e.g., ( < 4:, modulated above threshold by a large scintillation gain or noise fiuctuation 
or both, the probability of detecting the source at the second site can easily be less than 
10%. In accord with our previous discussion, a significantly lower threshold (e.g., 10 (A^), 
cf. Fig. 1^ and §|11]) would make detections at both sites much more probable, albeit with a 



larger background false-alarm rate. Part of the task of subsequent processing would be an 
assessment of whether excess hits above the background rate were statistically significant 



see 
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10.2. Future Two-station or Dual-Beam Observations 

The recommended procedure for dual-site observations is to report observations at low 
thresholds. This necessitates a large amount of storage for results in a dual-site search but 



raises the odds of a dual detection. As discussed in §0, there are additional reasons for 



reporting more measurements than are typically done in SETl. 

In the proposed BETA survey, a billion channel spectrometer will be used in a two-feed 
antenna system so that a given sky position is viewed successively in the two beams (HS93). 
The time delay is of order minutes, so scintillating sources may or may not be correlated 
in the two beam measurements, depending on observation frequency and source distance. 
The odds for dual detections depend on the observing threshold according to our previous 
considerations, with lower thresholds preferred. 



10.3. SETI Interferometry 

Welch (1983) has emphasized the inferential power of multiplying interferometry in 
SETI with respect to localization on the sky and for establishing the celestial nature of 
any real sources by seeing the influence of the expected Doppler shifts. ISS influences 
interferometric observations by attenuating the visibility function by a factor Ps{b), where b 
is the interferometer baseline. 



11. ALTERNATIVE DETECTION METHODS 

The conventional detection method in SETI and other astronomical surveys uses 
a threshold in signal-to-noise ratio (SNR) to define candidate signals. The false-alarm 
probability is often used to define the threshold in terms of the number of statistical trials 
made in the survey. 

Here we advocate an alternative method for SETI that uses more information in the 
data and a priori knowledge about the noise statistics. This method allows lower signal 
levels to be probed than in the SNR method. The scheme, first discussed by Zepka, 
Cordes & Wasserman (1994) for detecting X-ray sources in Poisson backgrounds, looks 
for departures from the expected shape of the intensity (or count-rate) histogram. The 
stopping criterion is that sources are identified iteratively until the histogram of residual 
intensities is consistent with a noise-only histogram. For this reason, we refer to the method 
as "pdfCLEAN." 
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For interference-free and source-free observations, the a priori intensity pdf is given in 
Eq. |B4| with S = 0, and is a one-sided exponential, exp(— 77). The histogram of intensities, 
hk, is the number of counts in the k^^ intensity bin with centroid value r]k- For A'^obs total 
observations, the mean value of the histogram is 

{hk) = A^obsexp(-r7fc). (59) 

The actual number of counts in h^ is a Poisson random variable such that the probability 
of obtaining h^ counts is given by 

p-{hk) (h, \hk 

pih) = ;r • (60) 

hkl 

Departures from the expected exponential shape are identified by seeking histogram bins 
where P{hk) is small, indicating an excess or deficit of counts. The pdfCLEAN method 
therefore makes use of a probability threshold in place of an SNR threshold in the SNR 
method. Once histogram bins are found in this way, they may be "deconstructed" to find 
the original sky positions and frequency channels of the contributing intensities. These may 
then be subjected to clustering tests to see if real celestial signals or terrestrial interference 
is the cause for departure. 

Application of pdfCLEAN requires substantial storage for raw data results because it 
can be used to probe intensities at lower SNR than can the SNR method itself. A discussion 
of actual storage requirements is given below. The payoff, however, is that real sources can 
be identified at low levels and the interference environment can be better understood as 
well. Zepka et al. (1994) show that the number of spurious detections (e.g., due to noise 
fluctuations) is small using pdfCLEAN. 



12. RECOMMENDATIONS FOR FUTURE SETI PROGRAMS 

Several lines of argument that we have presented lead to a primary recommendation for 
future SETI: Signal detections should be reported at much lower signal levels than have been 
used. In programs like META, where a large-scale survey is performed at one threshold with 
subsequent reobservations at a lower threhold, we argue that the reobservation threshold 
should be low enough to rule out our Model II where survey candidates are hypothesized to 
be scintillating but otherwise constant sources. 

The cost of reporting lower signal levels is a larger recording rate and, possibly, 
a problematic level of false alarms. The latter is particularly the case if the candidate 
frequency is expected to vary between survey detection and reobservation, necessitating a 
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large number of frequency channels to be searched during the reobservations. However, in 
§|Tl] we argue that false-alarms need not deter a survey from detecting weak sources. Use of 
the Zepka et al. (1994) pdfCLEAN method allows signals to be extracted from histogram 
counts that contain many noise-only measurements (i.e., false alarms). 

To estimate the threshold to be used for recording spectral amplitudes, we consider 
the technological limits on practical data recording. At present, it is feasible to record 
approximately one high-density magnetic tape per day and expect to analyze it at 
approximately the real-time rate on a network of workstations. This is approximately 10 
Gbytes/day. The data rate is 

Pmax = Tf, , (61) 

and the corresponding threshold is 

iVch/9 



r/T = In 



9.5 + In 




10GB/day\ /10 s 



(62) 



where Tg is the integration time for one spectrum, (3 is the number of bytes recorded per 
threshold crossing or "hit," and /3max is the maximum data recording rate. In Eq. ^ 
we have adopted parameters that typify near-future SETI spectrometers having ~ 10^ 
channels, such as BETA (HS93) and SERENDIP IV ( Powyer et al. 1994|) . The ~ 16 bytes 



recorded per hit would give the sky position, time, frequency channel, signal amphtude, and 
some modest line-shape parameterization (e.g., whether it was unresolved, its width, etc.) 

A semi- quantitative measure of the improvement offered by a lower threshold can be 
seen in Fig. ^ With a threshold of t^t = 10, the detection probability for an event at 
the survey threshold of META, rji = 32, is nearly a factor of 10^ greater than with the 
threshold used. Similarly, dual-site observations employing thresholds of 10 rather than 16, 
e.g., META and META II, would have larger detection probabilities by a factor of ~ 10. 

Whether a threshold as small as estimated is practical depends also on the extent to 
which spectral intensities depart from exponential statistics due to RFI, soft RAM errors, 
or celestial signals. From our considerations in previous sections, however, the hit threshold 
presented here would be more than adequate for a full statistical study of event amplitudes, 
especially if followed up with reobservations having a smaller number of channels (by a 
factor of 10^, say) and with a smaller threshold, 7]!^°^ ~ 5. The data rate for reobservations 
would be substantially smaller than in the survey and would therefore not have an impact 
on the overall data rate. Follow-up analysis would include rejection of signals that appear in 
multiple sky positions and a search for signals that repeat in single sky positions (consistent 
with the telescope beam). Application of the Zepka et al. (1994) pdfCLEAN method 
would search for intensity histogram bins which have an excess of counts. In this regard. 
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pdfCLEAN can investigate signals at levels where noise produces a nonnegligible number of 
false-alarm threshold crossings. 

The threshold in Eq. |62| is also adequate for two-station SETI, which can tolerate half 
the threshold of single-station SETI for roughly the same false-alarm rate. 

For intensities below the hit threshold, we advocate computation of a histogram of 
S /N for a generous number of subbands of frequency channels and for a set of individual 
sky positions. This low-intensity histogram provides the means for identifying low-level 
RFI that persists in specific frequency bands and allows redundancy testing for signals that 
repeat in more than one sky position. 

In summary, for future SETI we advocate a survey threshold given by r^T in Eq. ^ and 
where a modest amount of information is stored per hit (e.g., 16 bytes). Signals below the 
survey threshold would be recorded only as a count in a histogram calculated for a given 
subband of frequency channels and a coarse sky position. 



13. SUMMARY AND CONCLUSIONS 

In this paper we have discussed how interstellar scintillations cause intermittency in 
SETI and we have developed a formalism based on both the likelihood function and a 
Bayesian analysis for analyzing existing and future surveys. We emphasize that our results 
are based on the case where interstellar scintillations are saturated, producing modulations 
with a one-sided exponential distribution. This situation applies for distances beyond a 
frequency-dependent minimum distance, approximately a few hundred parsecs at 1 GHz. 
Our conclusions do not apply, quantitatively, for SETI that targets nearby stars using 
centimetric wavelengths, as in Project Phoenix ( [Tarter 1994|) . However, scintillation effects 



from the solar wind and from the stellar wind of the host star of transmitting sites will 
influence search sensitivities. These situations can also be analyzed using the methods of 
this paper. 

In applying our methods to existing surveys we have found that: 



In META ( [Horowitz fc Sagan 1993D , considering each candidate separately, it is 



extremely unlikely that any of the 9 candidates with amplitude > 33a was a mere 
noise fluctuation. A real signal, terrestrial or extraterrestrial, must underly these 
events. However, we are unable to distinguish between a steady signal modulated 
above threshold by scintillations and a transient signal, such as RFI, soft RAM errors, 
or transient ETI signals themselves. 
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• If any of the events in META is a due to a scintillating celestial source, it likely results 
from a modest to large scintillation gain combined with a favorable noise fluctuation. 

• We also show that existing reobservations of META candidate signals (i.e., those 
performed to date) are incapable of ruhng out the case where a real ETI source with 
constant, intrinsic signal strength underlies the measured candidate signal. This 
conclusion holds even for the case where the scintillations remain correlated between 
the time of an initial detection and prompt reobservations. Future reobservations are 
certainly capable of ruling out a constant-source model for META detections. 

• A stronger test of our signal models and of the celestial nature of candidate signals 

requires much lower thresholds and a larger number of reobservations than have been 
performed to date. For META, a reobservation threshold of 5-10 (in signal-to-noise 
ratio) rather than the actual 20 is needed. The number of reobservations needed 
is a function of threshold but is many thousands for the threshold used. In 
future surveys, correlated reobservations made promptly after the initial survey 
detection can drastically lower the total number of reobservations needed to rule out 
constant-strength ETI signal hypotheses. 

• Dual-site observations made simultaneously arc unlikely to yield dual detections of a 
source unless "low" thresholds are used for recording signal levels. 

Having developed these formalisms and applied them to existing surveys, we also 
present recommendations for planning future surveys. An important recommendation 
concerns the philosophy regarding events below threshold. In Horowitz & Sagan (1993), 
nearly all events below threshold were discarded. As a result, in this paper, we have 
treated all such events equally, since the only information reported was that the intensity 
was below threshold. In doing so, we have nevertheless illustrated that, with the 
appropriate combination of scintillations and noise, observed intensities of, say, 32{N) 
can be explained by scintillating sources. If such scintillating sources do exist, then there 
must be many occasions in which the intensity would have been below threshold. We 
recommend the following strategies, based upon the methods described in Zepka, Cordes, 
& Wasserman (1994), for estabhshing the existence of and recovering these signals: 

• A minimalist strategy would retain only those events exceeding a threshold (defined by 
the usual false-alarm probability) , but would form a histogram of those events below 
threshold. Since the expected distribution of the noise is known, viz. a one-sided 
exponential, then deviations from this histogram indicate the presence of non-noise 
signals. Of course, separating RFI from actual ETI signals still remains a problem, 
requiring new observations. 
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• The optimal strategy would retain individual intensity values even for those that are 
far below the false-alarm threshold. Clustering on the sky of results from multiple 
scans of the sky would suggest the locations of extraterrestrial sources that could be 
subjected to intense scrutiny by the flotilla of existing astronomical instrumentation. 

Application of our low-threshold strategy requires archiving of spectral amplitudes at 
large data rates. Our suggestion is that, given the large expense for conducting SETI, the 
data recorded and archived should be of commensurate cost and at a volume dictated by 
hardware available in the current market. 
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APPENDICES 
A. Symbols Used 



Symbol Definition 

I,II,III Denotes Model I, II, or III. 

a The exponent in the log N-\og S distribution of transmitters. 

By) Spectral domain in which ETI signals are transmitted. 

(3 The number of bytes recorded per threshold crossing, or "hit." 

/?niax The maximum data recording rate in bytes s~^. 

C The set of candidate signals found in a survey. 

cro Denotes reobservations with correlated scintillations. 

Dmfp Mean free path for the line of sight intersecting a source. 

Fraction of spectrometer channels containing ETI signals. 
eRFl Fraction of all observations that include terrestrial RFI signals. 

Csky Fraction of telescope beams containing ETI sources. 

•q = I/{N) The intensity measured in units of the mean background noise. 
?7c = The normalized intensity of a candidate. 

r/T = It/{N) The intensity threshold for detection. 
f(r]\I) The posterior pdf for the measured, normalized intensity 

given the background information I. 
f{6\r]I) Posterior pdf for a set of parameters 6 given measurements 

of the normalized intensity r] and background information T. 
/i,ii,iil The posterior pdf for Model I, II, or III. 

ded fg{g) The pdf of the scintillation modulation (or gain), g. 
f2g{gi,92', p) The bivariate pdf for two values of the scintillation gain g\^2 

as a function of the correlation coefficent p. 
fi{I, S) Intensity pdf for a constant source strength. 
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Symbol Definition 

fi,scintil, S) Intensity pdf for a source that varies due to scintillations. 

f2l{Ii, I2] S, p) The bivariatc intensity pdf. 

/rfi(Crfi) The pdf of normalized RFI source strengths. 

fsiS) The pdf for the intrinsic source strength of ETI transmitters. 

fgiO) The a priori pdf for a set of parameters 0. 

fi^iC) Pdf of normalized source strengths based on the spatial 

distribution and intrinsic radiated powers of ETI sources. 

g Scintillation "gain" for a compact source's radio flux. 

gc Scintillation gain applicable to a particular candidate event. 

Hq Half-thickness of the galactic disk. 

I Background information in Bayesian analysis. 

/ Intensity. 

ISS Interstellar scintillation or scattering. 

SI The rms variation in intensity. 

A/ The uncertainty in establishing intensities during a survey. 

Io{x) The modified Bessel function of order 0. 

/cand Intensity of a candidate from a search program. 

/t Intensity threshold in a search program. 

Kj- Number of reobservations of a candidate signal. 

Kj-^c Number of reobservations where the ISS gain is correlated 

with the gain for the original candidate signal. 

iCr,u Number of reobservations where the ISS gain is uncorrelated 

with that for the original candidate detection. 
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Symbol 


Definition 


L 1 5 J \jL I li 


Critical number of correlated reobservations needed to exclude 




the reality of survey candidate 


[■^^r,u] crit 


Critical number of uncorrelated reobservations needed to exclude 




the reality of a survey candidate. 




Number of separate observations of each sky position in a survey. 




Likelihood function for a candidate signal. 




Likelihood function for the candidates from a survey. 


-^non 


Likelihood function for the non-detections in a survey. 


A = log£ 


Log likelihood function for an individual candidate signal. 


A 


Total log likelihood function for an entire survey. 


id 


Characteristic length scale of the diffraction pattern. 


in 


Typical distance between transmitters. 




Maximum and minimum distances of Galactic transmitters. 


M ' 


The set of nondetections in a survey. 


N 


Noise intensity. 


{N) 


Ensemble average noise intensity. 




Number of Galactic transmitters in a telescope beam. 


-^cand 


Number of candidate signals found in a survey. 




Number of frequency channels in a SETI spectrometer. 


TT-D 


Number density of transmitters. 




Number of ETI transmitters in the Galaxy. 


Nd 


Number of sources such that £b = Hq. 


ATVfa 


Number of false alarms expected per frequency channel. 


-^frames 


Number of reference frames assumed for Doppler shifts. 
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Symbol Definition 

Ablings Number of distinet speetral lines transmitted by an ETI source. 

iVjy Number of separate center frequencies searched. 

A^poi Number of polarizations searched. 

-^signals/beam Number of Spectral lines from all ETI sources in a telescope beam. 

ATgky Number of sky positions (telescope beam areas) searched. 

iVsu = Ngi^yNchKs Total number of statistical trials in a survey. 

iVtriais Total number of statistical trials in a survey. 

iVtriais, noise Number of trials of the noise in a survey. 

-^triais,iss Number of trials of independent scintillations in a survey. 

Af(j Characteristic "diffraction" bandwidth for scintillations. 

Afgb Characteristic broadening width for a spectral line. 

^^^spectrometer Total bandwidth of a SETI spectrometer. 

f^b Solid angle of a radio telescope's primary beam. 

V Power radiated by a transmitter, assumed to be radiated isotropically 

[effective isotropic radiated power (EIRP)]. 

P{I) Probability that the intensity exceeds a value /. 

Pg{g) Probability that the scintillation gain exceeds a value g. 

Pd(-^T) S) Detection probability that a source with strength S 

produces an intensity that exceeds a threshold It- 

-Pd,scint(-fT5 S) Detection probability when a source of strength S scintillates. 

P2d{lT\h', S, p) Detection probability in a second observation given an initial detection 

with amplitude /i, a source strength S, and scintillation correlation p. 

Pfa Probability of a false-alarm detection. 

Py Probability that a band of frequencies contains ETI signals, 

pdf Probability density function 
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Symbol Definition 

TZ Set of intensity upper limits in reobservations made on survey candidates. 

r = Cii/Ciii Ratio of likelihood functions for Model II to Model III. 

rg = ero/csu Ratio of the fraction of pointings containing ETI transmitters for 

reobservations and survey. 

Rq Radius of the galactic disk. 

Pp(r) Normalized, temporal correlation function for the scintillation gain g 

function of time lag r. 

Pi{t) Normalized correlation function for the intensity. 

/9jv Normalized correlation function for radiometer noise. 

/9s (t) Normalized spatial correlation function for the scintillation gain g. 

ro Subscript or superscript to denote reobservations of candidates. 

S Intensity of ETI signal without any modification by scintillations. 

Si^2 Minimum & maximum fluxes of a galactic population of standard candles. 

S'rfi Intensity of a terrestrial RFI signal. 

5'i,RFi Maximum intensity of a terrestrial RFI signal. 

^OjEFi Minimum Intensity of a terrestrial RFI signal, 

su Subscript or superscript for a quantity relevant to a survey. 

Atd Characteristic diffraction time scale for scintillations (strong scattering regime). 

Tg Integration time for one spectrum. 

Ob One dimensional beam width (FWHM) of a radio telescope's primary beam, 

uro Subscript or superscript to denote scintillation-uncorrelated reobservations 

Vk Volume in the Galactic disk within a telescope beam. 

V± Perpendicular speed by which the observing geometry to a source changes. 



Symbol Definition 

Source signal strength in units of the mean background noise. 
Minimum and maximum normalized fluxes of a galactic 
population of standard candle transmitters. 
Minimum and maximum normalized RFI flux. 
RFI signal strength in units of the mean background noise. 



Cl,2 

C(0,1),RFI 

Crfi 
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B. Intensity Statistics 

Let the measured intensity (in a single cliannel of a spectrometer, say) be 

/ = |s + np (Bl) 

wliere n is complex, gaussian noise and s is a signal pliasor of fixed amplitude. The mean 
intensity is 

(/) = 5 + (iV) (B2) 
where S = |sp, = |?T,p, and the variance is 

= a% + 2S{N). (B3) 

For unsmoothed noise from a Fourier transform spectrometer, which we assume for the 
entirety of this paper, aN = {N). 

The probability density function (pdf) of / for an intrinsic signal intensity 5* is 

{i + sy 



/,(/;5) = (iV)-%(^^jexp 



{N) 



Uil), (B4) 



where Jq is the modified Bessel function and U{I) is the unit step function ( [Goodman 1984| ). 
With no signal {S = 0), the pdf becomes a one-sided exponential function. The pdf of y/l 
is the Rice distribution (e.g., [Papoulis 1991| ). 

The realistic case, where the signal strength varies due to scintillations, requires that 
S gS in Eq. where g is the appropriate scintillation "gain." Over short time spans 
(much less than a characteristic scintillation time), g = constant and the pdf of the intensity 
is of the form of Eq. However, over long time spans, g varies according to its own pdf, 
fg{g), and the resultant intensity pdf is obtained by integrating over the pdf of g: 

/7,scint(/; S) = Jdg fgig) /,(/; gS). (B5) 

For saturated scintillations, 

fgig)=exp{-g)U{g), (B6) 

yielding 

/7,scint(/; S) = exp (^-^^ U{I), (B7) 
where {!) = S + {N), as before. 



The utility of the two pdf 's in Eqs. and is as follows. In surveys where the dwell 
time per sky position is much less than the scintillation time scale, the apparent source 
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strength, gS, is fixed, and Eq. ^ is the appropriate form of the pdf to use in calculating 
detection probabilities, etc. of that source at that time. In large scale surveys of many 
independent sky positions, candidate detections of real signals will preferentially select large 
values of the product gS and, most likely, will be predisposed toward selecting abnormally 
large values of the scintillation gain g. However, in reobservations of candidate signals 
selected from an initial survey, it is likely that the reobservations will span, in toto, many 
characteristic scintillation times. The scintillation gain for any real source will vary over its 
allowed domain according to the exponential pdf (for strong scattering). In assessing the 
detectability of the source in these multiple reobservations, one must use the second form 
of the intensity pdf, i.e., Eq. p7| . 

Throughout this paper, we assume that the strong scattering regime obtains. For 
completeness, we note that there are two other scattering regimes (LC98): The weak 
scintillation regime is marked by a nearly symmetric scintillation gain pdf with a 
considerably smaller variance and a much longer scintillation time scale (~ 1 day) than the 
strong scattering case. In the transition regime the gain pdf has no simple analytical form 
but, when compared to an exponential distribution, has a much longer tail to large g while 
having a much longer scintillation time (of order the time scale in weak scintillation). At 
1 GHz, the weak scintillation regime extends to approximately 100 pc and the transition 
regime extends to approximately 500 pc. In order that there be sources within these two 
regimes, the number of civilizations in the Galaxy must exceed 10^ and 10'^ for the weak 
and transition regimes, respectively. These numbers follow by simply calculating the iVo 
needed to make the mean distance between civilizations equal to 100 or 500 pc. 



To calculate the probability that the intensity exceeds a specified threshold. It, i.e., a 
"detection," we integrate the appropriate pdf. 



for the case of a non-scintillating signal of strength S* or a single observation of a scintillating 
signal of observed strength S gS, and 



B.l. Signal Detection and False-alarm Probabilities 






(B9) 



for the case of many observations of a scintillating source. 



- 51 - 



For a scintillating source, the detection probability can be calculated exactly and is 

Pd,scint(/T; S) = exp (-jJ^^ . (BIO) 

For a source of observed, fixed strength gS, there exists no closed form expression for 
Eq. P3, however. 



If there is no signal, 5* = 0, the "detection" probability becomes the "false- alarm" 
probability, a measure of how often the intensity will exceed the detection threshold solely 
from noise fluctuations. This false-alarm probability is 

Pfa(/T) = Pd(/T; S = 0) = exp • (Bll) 

By restricting a survey to a certain false-alarm probability, one can specify It; for example, 
a false-alarm probability of 10^^^ corresponds to It = 27.6{N). Detection thresholds 
are set in most surveys so that the "false-alarm" rate yields a small number of spurious 
"detections" over the course of the survey. 



B.2. Second Order Intensity Statistics 



The scintillation gain varies on a characteristic time scale, At^, defined previously. 
Later, we will need to consider the joint statistics of the scintillation gain and the intensity 
at pairs of times. The bivariate pdf for the scintillation gain ( poodman 1984| ) is: 



f2g{gi, 92] p) 



P 



exp 



9i +92 
' 1-p 



'^y/9l92p 



P 



(B12) 



where p = p{t) is the temporal correlation function for the gain g, normalized as p(0) = 1. 
This function is roughly gaussian in form and has a width that is equal to the characteristic 
scintillation time of the source. The scintillation time, as remarked in § p.2| , is strongly 
frequency, direction, and distance dependent but can be modeled fairly well (cf. Fig. |I|). 
For reobservations made promptly such that r ^ Aid, P ^ 1 and f2g tends to a one 
sided exponential function multiplied by 5{9i — (72)- For long times between observations 
such that scintillations are independent, p — > and becomes the product of the two 
individual pdf's for gi and g2. 



From the bivariate pdf for g, we write the bivariate pdf for the intensity as 



/2/(/i, h\ S,p) = J J dgi dg2 /2g(fi'i, 92\ p)fi{h\ 9iS)fi{l2; g2S). (B13) 
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As with the scintillation gain, the intensities measured at two times are more highly 
correlated for time separations small compared to the scintillation time; conversely, they 
are statistically independent if the separations exceed the scintillation time. The degree of 
correlation, even at short time separations, need not be large, however, because this also 
depends on the signal to noise ratio. We write the intensity correlation function as 

{dl{t)51{t + r)) 



P'^^^ ^ {SP{t)) 
where we define 61 as the deviation from the mean noise level 



5I(t) = I{t) - (N) = g(t)S + 5N + 2^g{t)Sn{n). 
Defining the noise correlation function as 

aj,pr,{r) = {SN{t)SN{t + r)) 

we obtain, for lags r <S At^, 

-g{t)g{t + t)S' + {NfpN{T)[l + 2Sg{t)/{N)] 



(B14) 



(B15) 



(B16) 



[g(t)S+{N)]^ 

1. At a lag r = 0"^ such that the noise has decorrelated 



(B17) 



Note that p/(0) = pjv(O) 
completely (i.e., at a lag comparable to the time needed to calculate a single FFT in a 
digital Fourier-transform spectrometer), the intensity correlation is 

\2 



{gS/{N)r gS<^{N); 



P7(0+) <^ 1/4 



(B18) 



[1 gS:^{N). 

Thus, even though the scintillation gain may be completely correlated over an observation 
interval, the intensity need not be. Moreover, at rare times, the noise itself can produce a 
large spike that decorrelates on a short time scale. 

We find that large intensity excursions receive contributions from both noise and 
scintillation fluctuations. Even if the scintillations are highly correlated at two times, the 
noise will not be, and the intensities need not be. This property has important consequences 
for our abihty to confirm weak, candidate signals as being real ETI sources. 

It is useful to define a conditional detection probability for a measurement at a time 
At after an initial detection: 

P2d{lT\h;S,p[At]) = P{l2>lT\h;S,p[At])= / dl2f2i{Iul2;S,p[At]). (B19) 
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B.3. Intensities at Spatially Separated Sites 

Here we consider joint intensity statistics at two spatial locations for the purpose 
of analyzing SETl programs that involve simulataneous, dual station observations. Well 
separated sites provide a powerful means for rejecting RFI or instrumental effects that are 
peculiar to an individual site (Welch 1983). However, scintillations and system noise are 
also different, in general, at separated sites. 

The intensities measured at two sites at locations Xi^2 will both be of the form of 
Eq. IBT] with statistics as described in Eq. [E2l-[B7[ We must ask how the different elements 



of the intensity are correlated between the two locations. 

The noise is uncorrelated, except for that resulting from any compact sources in 
the beams of the two telescopes that would be detected if the two antennas were used 
as an interferometer. At centimetric frequencies and for typical system temperatures, 
any correlated flux typically amounts to a very small fraction of the total system noise. 
Henceforth, we ignore any such correlated noise. 

The scintillation gain g is generally not 100% correlated between two sites. In 



terms of the normalized, temporal correlation described in § p.2| , the spatial correlation is 
Ps(^ = V±'t) = p(^/Vj_), and where the characteristic width of of ps ranges between tens of 
km for a strongly scattered source and > 10^ km for a weakly scattered source observed at a 
frequency u = 1 GHz. The spatial correlation length increases with frequency as oc z/^'^. 

For an intrinsic source strength, S, the modulated source strength at the i^^ site at 
time ta and frequency is 

Sal3i = g{Xi,ta,l'p)S{ta,l'f3). (B20) 

Between two sites, the signal is partially correlated, according to the value of ps{t). The 
noise at the i^^ site is 

No^pi = N{x,,t^,up) (B21) 
and is uncorrelated between two different sites { a ^ a' , (3 ^ (3' , i ^ j) 

(No^piNo^'P'j) = {N{xi,to„ up)){N{x^,ta, z//?)). (B22) 

Generally, the cross-correlation of the signal portion of the intensity involves a 
multidimensional lag involving spatial separation, time lag, and frequency separation. We 
are interested in narrowband signals measured at identical times and we will consider the 
signals to be intrinsically constant (in time) and deterministic. For this case, if the only 
variation in signal between two sites is the spatial difference of the scintillation gain, then: 

{i9S)af3ii9S)af3j) = {g{xi,t^,vp)g{xj,ta,i'p))S'^{i'p) = [l + ps(|^i - Xj\)]S'^{iyf3). (B23) 
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In practice, there is a Doppler shift of the signal between the two sites that must be 
accounted for and which provides useful means for assessing the celestial nature of a 
source. But as long as the Doppler shift is less than the characteristic bandwidth of the 
scintillations, Eq. B23 still applies. Larger Doppler shifts can be described by a similar 
equation, but with an effective ps that is a combination of spatial and frequency correlations. 
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Fig. 1. — Top: Contours of the scintillation time scale as viewed looking down on the plane 
of the Galaxy. The + indicates the Galactic center and the dashed circles have radii of 5 and 
10 kpc. The offset figure shows the assumed locations of the spiral arms. The scintillation 
times refer to an observation frequency of 1.42 GHz and for directions at zero Galactic 
latitude. The scintillation time scales with frequency as u^ '^. Bottom: Contours of the 
scintillation bandwidth at 1.42 GHz. The bandwidth scales with frequency as i/^-^. 

Fig. 2. — Detection probabilities vs normalized source strength S/It for the case where the 
source docs not scintillate (dashed line) and for the case where it does scintillate in the strong 
scattering regime (solid line). 

Fig. 3. — Detection probability for a second detection above a threshold tjt = /x/(iV) given 
an initial detection with intensity rji = Ii/{N). Curves are plotted as a function of pg, 
which may be either the spatial or temporal correlation function of the scintillation gain. 
Curves are labelled with the intrinsic source strength ( — S/{N), i.e., the source strength 
in the absence of scintillations, a) rji — 32, rj-p — 20, typical META parameters; b) rji — 32, 
r)T — 10, recommended rcobscrvation threshold for future META-like surveys; c) rji — 16, 
r/T = 16, typical dual-site META-META 11 parameters; d) rji — 10, rjT — 10, recommended 
thresholds for future dual-site observations. 

Fig. 4. — Representative time series for the three models for META events; the dashed 
lines indicate a detection threshold: a) Noise only, b) A scintillating source with amplitude 
gS — 5{N) combined with noise. It is assumed that the scintillation gain is constant over 
the time series, c) A scintillating source with time- varying scintillation gain, g{t), combined 
with noise, d) RFI pulse combined with noise. 

Fig. 5. — The joint a posteriori pdf for the scintillation gain Qc and intrinsic signal strength 
( = S/{N). Contours arc at 95%, 90%, 50%, 10%, 1%, and 0.1% of the peak. The number 
of rcobscrvations is assumed to be 256. a) For a candidate with rj^ = 32 and a rcobscrvation 
threshold of 7;^°^ = 20, typical of the META survey, b) For r/c = 100 and 7]!^°^ = 20. c) For 
r/c = 32 and 7]^°^ = 10. d) For r]^ = 100 and r?f°^ = 10. 

Fig. 6. — Best fit values for g^, C) ^-nd their product gcC for an initial candidate detection 
plotted against the number of reobservations. It is assumed that the scintillation gain g is 
uncorrelated between reobservations and between any rcobscrvation and the original survey 
detection. Results are shown for a survey amplitude rjc = 32 and for three values of the 
rcobscrvation threshold, rj!^°^ — 5, 15, and 20. a) ^ vs. Kj.^^. b) g^ vs. i^^r.u- c) gcC, vs. i^^r.u- 
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Fig. 7. — [Top] The critical number of reobservations [Kr.ujcrit plotted against reobservation 
threshold r/x""*- [-^r,u]crit is the number of reobservations needed for a given reobservation 
threshold 7]!^"^ in order to render Model II highly improbable. The curves are labelled by the 
intensity of the candidate signal, rjc, at the initial detection. [Bottom] The number of false 
alarm detections expected (per frequency channel) in [-R'r^Jcrit reobservations. The dashed 
line designates one false alarm per frequency channel, an unacceptably large number. 



Fig. 8. — Similar to Fig. |^, but where correlated reobservations, K^-^c, are included in the 
analysis, a) Candidate intensity rjc = 32 and reobservation threshold 1]!^°^ = 20. b) rjc = 32 
and rj!^"^ = 10. c) rjc = 32 and rj^"^ = 5. 



Fig. 9. — As for Fig. |^ but for correlated reobservations [-ft'r,c]crit- 



Fig. 10. — Contours of the logarithm of the survey likelihood functions for the META 
candidates. Contours are at 95%, 90%, 50%, 10%, 1%, and 0.1% of the peak in each panel. 
a) The survey likelihood function, including reobservations, of the four META candidates at 
1420 MHz assuming Model II, namely a population of standard candle transmitters in the 
Galactic disk which are modulated by scintillation, b) As for (a), but for the seven 2840 MHz 
candidates, c) As for (a), but for Model HI, namely RFI. d) As for (6), but for Model HI, 
namely RFI. 



Fig. 11. — The fraction of the sky having ETI sources in the telescope beam, esky, as a 
function of the mean free path for a beam to intersect an ETI source. We assume a uniform 
galactic disk of radius 15 kpc and thickness 0.2 kpc. 



Fig. 12. — The number of sources A^d plotted against survey fraction esu- The solid curve 
is the estimate for A^d obtained from Eq. ^ while the dashed line results from Eq. 57. 
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Table 1. SETI PROGRAM PARAMETERS 



Parameter 


META 


META H 


SERENDIP HI 




2 


1 


5 


A-frames 


3 


3 


1 




223 


223 


222 


A^poi 


2 


2 


1 


A^sky 


105.2 


105.0 


105.8 




4 


1 


2.3 (ave) 


A^trials 


1013.8 


1013.2 


1013.9 


Beam size (deg) 


0.5 (1.42 GHz) 


0.5 (1.42 GHz) 


0.15 (0.42 GHz) 




0.25 (2.84 GHz) 






Az/ch (Hz) 


0.05 


0.05 


0.6 


Az/totai (MHz) 


0.4 


0.4 


12 


dwell time (s) 


20 


20 


1.7 


Thresholds: 








survey 


31.7(A^) 


24 (iV) 


15(A^) 


reobserve 


20(A^) 






Sensitivities: (EIRP = effective isotropic radiation power.) 




EIRP (W m-2) 


10-22.8 




10 ^^-^ 




Table 2. Bayesian Maxima for META 








9c C 






32 20 


6.2 2.6 






100 20 


16.3 3.9 





32 10 < 10-3 < 10-^ 
100 10 22.0 2.0 
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Table 3. Likelihood Maxima for META 



V 


max(A) CoCsu 


Ci 


(MHz) 








Model II: Survey 




1420 


-61.6 10-1^-9 


102.78 


2840 


-107 10-1^-6 


103.54 


Model 11: Survcy+Rcobservations 


1420 


-61.6 10^1^-9 


102.78 


2840 


-107 10-^i-6 


103.54 



Model III: Survey 
1420 -61.3 10-13.6 102.42 

2840 -108 10-13-2 W"^^ 



Model 111: Survcy+Rcobscrvations 
1420 -61.3 10-13-6 102-42 

2840 -108 10-13-2 102-91 
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